-
Ultrasensitive Magnetometer based on Cusp Points of the Photon-Magnon Synchronization Mode
Authors:
Xinlin Mi,
Jinwei Rao,
Lijun Yan,
Xudong Wang,
Bingbing Lyu,
Bimu Yao,
Shishen Yan,
Lihui Bai
Abstract:
Ultrasensitive magnetometers based on spin resonances have led to remarkable achievements. However, the gyromagnetic ratios of these spin resonances that determine the responsivity of magnetometers to weak magnetic fields are inherently constrained by the Land$\acute{e}$ g-factor of particles, such as the electron, with a constant gyromagnetic ratio of $γ_e=2π\times28$ GHz/T. Here, we demonstrate…
▽ More
Ultrasensitive magnetometers based on spin resonances have led to remarkable achievements. However, the gyromagnetic ratios of these spin resonances that determine the responsivity of magnetometers to weak magnetic fields are inherently constrained by the Land$\acute{e}$ g-factor of particles, such as the electron, with a constant gyromagnetic ratio of $γ_e=2π\times28$ GHz/T. Here, we demonstrate an ultrasensitive magnetometer based on the cusp point (CP) of photon-magnon synchronization modes (PMSMs). The PMSM's gyromagnetic ratio at the CP is enhanced to $37γ_e$ and further amplified to $236γ_e$ by utilizing the sixth-order oscillating mode of the PMSM. Moreover, the emission linewidth of the PMSM can be reduced to 0.06 Hz, resulting in excellent sensitivity to weak magnetic fields. These outstanding properties position our magnetometer to potentially achieve superior sensitivity to conventional magnetometers. Our work introduces a cost-effective prototype for the next generation of magnetometry, and may advance scientific research and technologies that rely on ultrasensitive magnetic field detection.
△ Less
Submitted 8 July, 2025;
originally announced July 2025.
-
MODS: Multi-source Observations Conditional Diffusion Model for Meteorological State Downscaling
Authors:
Siwei Tu,
Jingyi Xu,
Weidong Yang,
Lei Bai,
Ben Fei
Abstract:
Accurate acquisition of high-resolution surface meteorological conditions is critical for forecasting and simulating meteorological variables. Directly applying spatial interpolation methods to derive meteorological values at specific locations from low-resolution grid fields often yields results that deviate significantly from the actual conditions. Existing downscaling methods primarily rely on…
▽ More
Accurate acquisition of high-resolution surface meteorological conditions is critical for forecasting and simulating meteorological variables. Directly applying spatial interpolation methods to derive meteorological values at specific locations from low-resolution grid fields often yields results that deviate significantly from the actual conditions. Existing downscaling methods primarily rely on the coupling relationship between geostationary satellites and ERA5 variables as a condition. However, using brightness temperature data from geostationary satellites alone fails to comprehensively capture all the changes in meteorological variables in ERA5 maps. To address this limitation, we can use a wider range of satellite data to make more full use of its inversion effects on various meteorological variables, thus producing more realistic results across different meteorological variables. To further improve the accuracy of downscaling meteorological variables at any location, we propose the Multi-source Observation Down-Scaling Model (MODS). It is a conditional diffusion model that fuses data from multiple geostationary satellites GridSat, polar-orbiting satellites (AMSU-A, HIRS, and MHS), and topographic data (GEBCO), as conditions, and is pre-trained on the ERA5 reanalysis dataset. During training, latent features from diverse conditional inputs are extracted separately and fused into ERA5 maps via a multi-source cross-attention module. By exploiting the inversion relationships between reanalysis data and multi-source atmospheric variables, MODS generates atmospheric states that align more closely with real-world conditions. During sampling, MODS enhances downscaling consistency by incorporating low-resolution ERA5 maps and station-level meteorological data as guidance. Experimental results demonstrate that MODS achieves higher fidelity when downscaling ERA5 maps to a 6.25 km resolution.
△ Less
Submitted 2 June, 2025;
originally announced June 2025.
-
Fast Non-Line-of-Sight Transient Data Simulation and an Open Benchmark Dataset
Authors:
Yingjie Shi,
Jinye Miao,
Taotao Qin,
Fuyao Cai,
Yi Wei,
Lingfeng Liu,
Tongyao Li,
Chenyang Wu,
Huan Liang,
Yuyang Yin,
Lianfa Bai,
Enlai Guo,
Jing Han
Abstract:
Non-Line-of-Sight (NLOS) imaging reconstructs the shape and depth of hidden objects from picosecond-resolved transient signals, offering potential applications in autonomous driving, security, and medical diagnostics. However, current NLOS experiments rely on expensive hardware and complex system alignment, limiting their scalability. This manuscript presents a simplified simulation method that ge…
▽ More
Non-Line-of-Sight (NLOS) imaging reconstructs the shape and depth of hidden objects from picosecond-resolved transient signals, offering potential applications in autonomous driving, security, and medical diagnostics. However, current NLOS experiments rely on expensive hardware and complex system alignment, limiting their scalability. This manuscript presents a simplified simulation method that generates NLOS transient data by modeling light-intensity transport rather than performing conventional path tracing, significantly enhancing computational efficiency. All scene elements, including the relay surface, hidden target, stand-off distance, detector time resolution, and acquisition window are fully parameterized, allowing for rapid configuration of test scenarios. Reconstructions based on the simulated data accurately recover hidden geometries, validating the effectiveness of the approach. The proposed tool reduces the entry barrier for NLOS research and supports the optimization of system design.
△ Less
Submitted 4 June, 2025;
originally announced June 2025.
-
Scaling Physical Reasoning with the PHYSICS Dataset
Authors:
Shenghe Zheng,
Qianjia Cheng,
Junchi Yao,
Mengsong Wu,
Haonan He,
Ning Ding,
Yu Cheng,
Shuyue Hu,
Lei Bai,
Dongzhan Zhou,
Ganqu Cui,
Peng Ye
Abstract:
Large Language Models (LLMs) have achieved remarkable progress on advanced reasoning tasks such as mathematics and coding competitions. Meanwhile, physics, despite being both reasoning-intensive and essential to real-world understanding, received limited academic and industrial attention. This paper introduces PHYSICS, a dataset containing 16,568 high-quality physics problems spanning subjects and…
▽ More
Large Language Models (LLMs) have achieved remarkable progress on advanced reasoning tasks such as mathematics and coding competitions. Meanwhile, physics, despite being both reasoning-intensive and essential to real-world understanding, received limited academic and industrial attention. This paper introduces PHYSICS, a dataset containing 16,568 high-quality physics problems spanning subjects and difficulty levels, to facilitate this issue. Specifically, PHYSICS is curated with exercises from over 100 textbooks through a carefully designed pipeline for quality control. It covers five major physics domains: Mechanics, Electromagnetism, Thermodynamics, Optics, and Modern Physics. It also spans a wide range of difficulty levels, from high school to graduate-level physics courses. To utilize the data for improving and evaluating the model's physical reasoning capabilities, we split the dataset into training and test sets, and provide reasoning paths generated by powerful reasoning models for the training data to facilitate model training. In addition, for the evaluation part, we find that existing evaluation frameworks exhibit biases in aspects such as units, simplification, and precision in physics domain. To balance efficiency and accuracy, we introduce a Rule+Model evaluation framework tailored to physics problems. Our evaluations on current state-of-the-art open-source and proprietary models highlight the limitations of current models in handling physics-related tasks. We hope that our dataset and evaluation methodology will jointly advance the development of LLMs in the field of physics.
△ Less
Submitted 2 June, 2025; v1 submitted 21 May, 2025;
originally announced June 2025.
-
Align-DA: Align Score-based Atmospheric Data Assimilation with Multiple Preferences
Authors:
Jing-An Sun,
Hang Fan,
Junchao Gong,
Ben Fei,
Kun Chen,
Fenghua Ling,
Wenlong Zhang,
Wanghan Xu,
Li Yan,
Pierre Gentine,
Lei Bai
Abstract:
Data assimilation (DA) aims to estimate the full state of a dynamical system by combining partial and noisy observations with a prior model forecast, commonly referred to as the background. In atmospheric applications, this problem is fundamentally ill-posed due to the sparsity of observations relative to the high-dimensional state space. Traditional methods address this challenge by simplifying b…
▽ More
Data assimilation (DA) aims to estimate the full state of a dynamical system by combining partial and noisy observations with a prior model forecast, commonly referred to as the background. In atmospheric applications, this problem is fundamentally ill-posed due to the sparsity of observations relative to the high-dimensional state space. Traditional methods address this challenge by simplifying background priors to regularize the solution, which are empirical and require continual tuning for application. Inspired by alignment techniques in text-to-image diffusion models, we propose Align-DA, which formulates DA as a generative process and uses reward signals to guide background priors, replacing manual tuning with data-driven alignment. Specifically, we train a score-based model in the latent space to approximate the background-conditioned prior, and align it using three complementary reward signals for DA: (1) assimilation accuracy, (2) forecast skill initialized from the assimilated state, and (3) physical adherence of the analysis fields. Experiments with multiple reward signals demonstrate consistent improvements in analysis quality across different evaluation metrics and observation-guidance strategies. These results show that preference alignment, implemented as a soft constraint, can automatically adapt complex background priors tailored to DA, offering a promising new direction for advancing the field.
△ Less
Submitted 28 May, 2025;
originally announced May 2025.
-
Deep Reparameterization for Full Waveform Inversion: Architecture Benchmarking, Robust Inversion, and Multiphysics Extension
Authors:
Feng Liu,
Yaxing Li,
Rui Su,
Jianping Huang,
Lei Bai
Abstract:
Full waveform inversion (FWI) is a high-resolution subsurface imaging technique, but its effectiveness is limited by challenges such as noise contamination, sparse acquisition, and artifacts from multiparameter coupling. To address these limitations, this study develops a deep reparameterized FWI (DR-FWI) framework, in which subsurface parameters are represented by a deep neural network. Instead o…
▽ More
Full waveform inversion (FWI) is a high-resolution subsurface imaging technique, but its effectiveness is limited by challenges such as noise contamination, sparse acquisition, and artifacts from multiparameter coupling. To address these limitations, this study develops a deep reparameterized FWI (DR-FWI) framework, in which subsurface parameters are represented by a deep neural network. Instead of directly optimizing the parameters, DR-FWI optimizes the network weights to reconstruct them, thereby embedding structural priors and facilitating optimization. To provide benchmark guidelines for the design of DR-FWI, we conduct a comparative analysis of three representative architectures (U-Net, CNN, MLP) combined with two initial model embedding strategies: one pretraining the network to generate predefined initial models (pretraining-based), while the other directly adds network outputs to the initial models. Extensive ablation experiments show that combining CNN with pretraining-based initialization significantly enhances inversion accuracy, offering valuable insights into network design. To further understand the mechanism of DR-FWI, spectral bias analysis reveals that the network first captures low-frequency features and gradually reconstructs high-frequency details, enabling an adaptive multi-scale inversion strategy. Notably, the robustness of DR-FWI is validated under various noise levels and sparse acquisition scenarios, where its strong performance with limited shots and receivers demonstrates reduced reliance on dense observational data. Additionally, a backbone-branch structure is proposed to extend DR-FWI to multiparameter inversion, and its efficacy in mitigating cross-parameter interference is validated on a synthetic anomaly model and the Marmousi2 model. These results suggest a promising direction for joint inversion involving multiple parameters or multiphysics.
△ Less
Submitted 21 June, 2025; v1 submitted 24 April, 2025;
originally announced April 2025.
-
Satellite Observations Guided Diffusion Model for Accurate Meteorological States at Arbitrary Resolution
Authors:
Siwei Tu,
Ben Fei,
Weidong Yang,
Fenghua Ling,
Hao Chen,
Zili Liu,
Kun Chen,
Hang Fan,
Wanli Ouyang,
Lei Bai
Abstract:
Accurate acquisition of surface meteorological conditions at arbitrary locations holds significant importance for weather forecasting and climate simulation. Due to the fact that meteorological states derived from satellite observations are often provided in the form of low-resolution grid fields, the direct application of spatial interpolation to obtain meteorological states for specific location…
▽ More
Accurate acquisition of surface meteorological conditions at arbitrary locations holds significant importance for weather forecasting and climate simulation. Due to the fact that meteorological states derived from satellite observations are often provided in the form of low-resolution grid fields, the direct application of spatial interpolation to obtain meteorological states for specific locations often results in significant discrepancies when compared to actual observations. Existing downscaling methods for acquiring meteorological state information at higher resolutions commonly overlook the correlation with satellite observations. To bridge the gap, we propose Satellite-observations Guided Diffusion Model (SGD), a conditional diffusion model pre-trained on ERA5 reanalysis data with satellite observations (GridSat) as conditions, which is employed for sampling downscaled meteorological states through a zero-shot guided sampling strategy and patch-based methods. During the training process, we propose to fuse the information from GridSat satellite observations into ERA5 maps via the attention mechanism, enabling SGD to generate atmospheric states that align more accurately with actual conditions. In the sampling, we employed optimizable convolutional kernels to simulate the upscale process, thereby generating high-resolution ERA5 maps using low-resolution ERA5 maps as well as observations from weather stations as guidance. Moreover, our devised patch-based method promotes SGD to generate meteorological states at arbitrary resolutions. Experiments demonstrate SGD fulfills accurate meteorological states downscaling to 6.25km.
△ Less
Submitted 8 February, 2025;
originally announced February 2025.
-
Physically Consistent Global Atmospheric Data Assimilation with Machine Learning in Latent Space
Authors:
Hang Fan,
Lei Bai,
Ben Fei,
Yi Xiao,
Kun Chen,
Yubao Liu,
Yongquan Qu,
Fenghua Ling,
Pierre Gentine
Abstract:
Data assimilation (DA) integrates observations with model forecasts to produce optimized atmospheric states, whose physical consistency is critical for stable weather forecasting and reliable climate research. Traditional Bayesian DA methods enforce these nonlinear, flow-dependent physical constraints through empirical and tunable covariance structures, but with limited accuracy and robustness. He…
▽ More
Data assimilation (DA) integrates observations with model forecasts to produce optimized atmospheric states, whose physical consistency is critical for stable weather forecasting and reliable climate research. Traditional Bayesian DA methods enforce these nonlinear, flow-dependent physical constraints through empirical and tunable covariance structures, but with limited accuracy and robustness. Here, we introduce Latent Data Assimilation (LDA), a framework that performs Bayesian DA in a latent space learned from multivariate global atmospheric data via an autoencoder. We demonstrate that the autoencoder can largely capture nonlinear physical relationships, enabling LDA to produce balanced analyses without explicitly modeling physical constraints. Assimilation in latent space also improves both analysis quality and forecast skill compared to traditional model-space DA, under both idealized and real observational settings. Furthermore, LDA exhibits strong robustness across latent dimensions and remains effective even when the autoencoder is trained on inaccurate but physically realistic forecasts, highlighting its flexibility for real-world applications.
△ Less
Submitted 8 July, 2025; v1 submitted 4 February, 2025;
originally announced February 2025.
-
DispFormer: Pretrained Transformer for Flexible Dispersion Curve Inversion from Global Synthesis to Regional Applications
Authors:
Feng Liu,
Bao Deng,
Rui Su,
Lei Bai,
Wanli Ouyang
Abstract:
Surface wave dispersion curve inversion is essential for estimating subsurface Shear-wave velocity ($v_s$), yet traditional methods often struggle to balance computational efficiency with inversion accuracy. While deep learning approaches show promise, previous studies typically require large amounts of labeled data and struggle with real-world datasets that have varying period ranges, missing dat…
▽ More
Surface wave dispersion curve inversion is essential for estimating subsurface Shear-wave velocity ($v_s$), yet traditional methods often struggle to balance computational efficiency with inversion accuracy. While deep learning approaches show promise, previous studies typically require large amounts of labeled data and struggle with real-world datasets that have varying period ranges, missing data, and low signal-to-noise ratios. This study proposes DispFormer, a transformer-based neural network for inverting the $v_s$ profile from Rayleigh-wave phase and group dispersion curves. DispFormer processes dispersion data at each period independently, thereby allowing it to handle data of varying lengths without requiring network modifications or alignment between training and testing data. The performance is demonstrated by pre-training it on a global synthetic dataset and testing it on two regional synthetic datasets using zero-shot and few-shot strategies. Results indicate that zero-shot DispFormer, even without any labeled data, produces inversion profiles that match well with the ground truth, providing a deployable initial model generator to assist traditional methods. When labeled data is available, few-shot DispFormer outperforms traditional methods with only a small number of labels. Furthermore, real-world tests indicate that DispFormer effectively handles varying length data, and yields lower data residuals than reference models. These findings demonstrate that DispFormer provides a robust foundation model for dispersion curve inversion and is a promising approach for broader applications.
△ Less
Submitted 8 January, 2025;
originally announced January 2025.
-
LWFNet: Coherent Doppler Wind Lidar-Based Network for Wind Field Retrieval
Authors:
Ran Tao,
Chong Wang,
Hao Chen,
Mingjiao Jia,
Xiang Shang,
Luoyuan Qu,
Guoliang Shentu,
Yanyu Lu,
Yanfeng Huo,
Lei Bai,
Xianghui Xue,
Xiankang Dou
Abstract:
Accurate detection of wind fields within the troposphere is essential for atmospheric dynamics research and plays a crucial role in extreme weather forecasting. Coherent Doppler wind lidar (CDWL) is widely regarded as the most suitable technique for high spatial and temporal resolution wind field detection. However, since coherent detection relies heavily on the concentration of aerosol particles,…
▽ More
Accurate detection of wind fields within the troposphere is essential for atmospheric dynamics research and plays a crucial role in extreme weather forecasting. Coherent Doppler wind lidar (CDWL) is widely regarded as the most suitable technique for high spatial and temporal resolution wind field detection. However, since coherent detection relies heavily on the concentration of aerosol particles, which cause Mie scattering, the received backscattering lidar signal exhibits significantly low intensity at high altitudes. As a result, conventional methods, such as spectral centroid estimation, often fail to produce credible and accurate wind retrieval results in these regions. To address this issue, we propose LWFNet, the first Lidar-based Wind Field (WF) retrieval neural Network, built upon Transformer and the Kolmogorov-Arnold network. Our model is trained solely on targets derived from the traditional wind retrieval algorithm and utilizes radiosonde measurements as the ground truth for test results evaluation. Experimental results demonstrate that LWFNet not only extends the maximum wind field detection range but also produces more accurate results, exhibiting a level of precision that surpasses the labeled targets. This phenomenon, which we refer to as super-accuracy, is explored by investigating the potential underlying factors that contribute to this intriguing occurrence. In addition, we compare the performance of LWFNet with other state-of-the-art (SOTA) models, highlighting its superior effectiveness and capability in high-resolution wind retrieval. LWFNet demonstrates remarkable performance in lidar-based wind field retrieval, setting a benchmark for future research and advancing the development of deep learning models in this domain.
△ Less
Submitted 5 January, 2025;
originally announced January 2025.
-
CA-MoE: Channel-Adapted MoE for Incremental Weather Forecasting
Authors:
Hao Chen,
Han Tao,
Guo Song,
Jie Zhang,
Yunlong Yu,
Yonghan Dong,
Chuang Yang,
Lei Bai
Abstract:
Atmospheric science is intricately connected with other fields, e.g., geography and aerospace. Most existing approaches involve training a joint atmospheric and geographic model from scratch, which incurs significant computational costs and overlooks the potential for incremental learning of weather variables across different domains. In this paper, we introduce incremental learning to weather for…
▽ More
Atmospheric science is intricately connected with other fields, e.g., geography and aerospace. Most existing approaches involve training a joint atmospheric and geographic model from scratch, which incurs significant computational costs and overlooks the potential for incremental learning of weather variables across different domains. In this paper, we introduce incremental learning to weather forecasting and propose a novel structure that allows for the flexible expansion of variables within the model. Specifically, our method presents a Channel-Adapted MoE (CA-MoE) that employs a divide-and-conquer strategy. This strategy assigns variable training tasks to different experts by index embedding and reduces computational complexity through a channel-wise Top-K strategy. Experiments conducted on the widely utilized ERA5 dataset reveal that our method, utilizing only approximately 15\% of trainable parameters during the incremental stage, attains performance that is on par with state-of-the-art competitors. Notably, in the context of variable incremental experiments, our method demonstrates negligible issues with catastrophic forgetting.
△ Less
Submitted 3 December, 2024;
originally announced December 2024.
-
FengWu-W2S: A deep learning model for seamless weather-to-subseasonal forecast of global atmosphere
Authors:
Fenghua Ling,
Kang Chen,
Jiye Wu,
Tao Han,
Jing-Jia Luo,
Wanli Ouyang,
Lei Bai
Abstract:
Seamless forecasting that produces warning information at continuum timescales based on only one system is a long-standing pursuit for weather-climate service. While the rapid advancement of deep learning has induced revolutionary changes in classical forecasting field, current efforts are still focused on building separate AI models for weather and climate forecasts. To explore the seamless forec…
▽ More
Seamless forecasting that produces warning information at continuum timescales based on only one system is a long-standing pursuit for weather-climate service. While the rapid advancement of deep learning has induced revolutionary changes in classical forecasting field, current efforts are still focused on building separate AI models for weather and climate forecasts. To explore the seamless forecasting ability based on one AI model, we propose FengWu-Weather to Subseasonal (FengWu-W2S), which builds on the FengWu global weather forecast model and incorporates an ocean-atmosphere-land coupling structure along with a diverse perturbation strategy. FengWu-W2S can generate 6-hourly atmosphere forecasts extending up to 42 days through an autoregressive and seamless manner. Our hindcast results demonstrate that FengWu-W2S reliably predicts atmospheric conditions out to 3-6 weeks ahead, enhancing predictive capabilities for global surface air temperature, precipitation, geopotential height and intraseasonal signals such as the Madden-Julian Oscillation (MJO) and North Atlantic Oscillation (NAO). Moreover, our ablation experiments on forecast error growth from daily to seasonal timescales reveal potential pathways for developing AI-based integrated system for seamless weather-climate forecasting in the future.
△ Less
Submitted 19 November, 2024; v1 submitted 15 November, 2024;
originally announced November 2024.
-
WeatherGFM: Learning A Weather Generalist Foundation Model via In-context Learning
Authors:
Xiangyu Zhao,
Zhiwang Zhou,
Wenlong Zhang,
Yihao Liu,
Xiangyu Chen,
Junchao Gong,
Hao Chen,
Ben Fei,
Shiqi Chen,
Wanli Ouyang,
Xiao-Ming Wu,
Lei Bai
Abstract:
The Earth's weather system encompasses intricate weather data modalities and diverse weather understanding tasks, which hold significant value to human life. Existing data-driven models focus on single weather understanding tasks (e.g., weather forecasting). Although these models have achieved promising results, they fail to tackle various complex tasks within a single and unified model. Moreover,…
▽ More
The Earth's weather system encompasses intricate weather data modalities and diverse weather understanding tasks, which hold significant value to human life. Existing data-driven models focus on single weather understanding tasks (e.g., weather forecasting). Although these models have achieved promising results, they fail to tackle various complex tasks within a single and unified model. Moreover, the paradigm that relies on limited real observations for a single scenario hinders the model's performance upper bound. In response to these limitations, we draw inspiration from the in-context learning paradigm employed in state-of-the-art visual foundation models and large language models. In this paper, we introduce the first generalist weather foundation model (WeatherGFM), designed to address a wide spectrum of weather understanding tasks in a unified manner. More specifically, we initially unify the representation and definition of the diverse weather understanding tasks. Subsequently, we devised weather prompt formats to manage different weather data modalities, namely single, multiple, and temporal modalities. Finally, we adopt a visual prompting question-answering paradigm for the training of unified weather understanding tasks. Extensive experiments indicate that our WeatherGFM can effectively handle up to ten weather understanding tasks, including weather forecasting, super-resolution, weather image translation, and post-processing. Our method also showcases generalization ability on unseen tasks.
△ Less
Submitted 8 December, 2024; v1 submitted 8 November, 2024;
originally announced November 2024.
-
SIFM: A Foundation Model for Multi-granularity Arctic Sea Ice Forecasting
Authors:
Jingyi Xu,
Yeqi Luo,
Weidong Yang,
Keyi Liu,
Shengnan Wang,
Ben Fei,
Lei Bai
Abstract:
Arctic sea ice performs a vital role in global climate and has paramount impacts on both polar ecosystems and coastal communities. In the last few years, multiple deep learning based pan-Arctic sea ice concentration (SIC) forecasting methods have emerged and showcased superior performance over physics-based dynamical models. However, previous methods forecast SIC at a fixed temporal granularity, e…
▽ More
Arctic sea ice performs a vital role in global climate and has paramount impacts on both polar ecosystems and coastal communities. In the last few years, multiple deep learning based pan-Arctic sea ice concentration (SIC) forecasting methods have emerged and showcased superior performance over physics-based dynamical models. However, previous methods forecast SIC at a fixed temporal granularity, e.g. sub-seasonal or seasonal, thus only leveraging inter-granularity information and overlooking the plentiful inter-granularity correlations. SIC at various temporal granularities exhibits cumulative effects and are naturally consistent, with short-term fluctuations potentially impacting long-term trends and long-term trends provides effective hints for facilitating short-term forecasts in Arctic sea ice. Therefore, in this study, we propose to cultivate temporal multi-granularity that naturally derived from Arctic sea ice reanalysis data and provide a unified perspective for modeling SIC via our Sea Ice Foundation Model. SIFM is delicately designed to leverage both intra-granularity and inter-granularity information for capturing granularity-consistent representations that promote forecasting skills. Our extensive experiments show that SIFM outperforms off-the-shelf deep learning models for their specific temporal granularity.
△ Less
Submitted 16 October, 2024;
originally announced October 2024.
-
IceDiff: High Resolution and High-Quality Sea Ice Forecasting with Generative Diffusion Prior
Authors:
Jingyi Xu,
Siwei Tu,
Weidong Yang,
Shuhao Li,
Keyi Liu,
Yeqi Luo,
Lipeng Ma,
Ben Fei,
Lei Bai
Abstract:
Variation of Arctic sea ice has significant impacts on polar ecosystems, transporting routes, coastal communities, and global climate. Tracing the change of sea ice at a finer scale is paramount for both operational applications and scientific studies. Recent pan-Arctic sea ice forecasting methods that leverage advances in artificial intelligence has made promising progress over numerical models.…
▽ More
Variation of Arctic sea ice has significant impacts on polar ecosystems, transporting routes, coastal communities, and global climate. Tracing the change of sea ice at a finer scale is paramount for both operational applications and scientific studies. Recent pan-Arctic sea ice forecasting methods that leverage advances in artificial intelligence has made promising progress over numerical models. However, forecasting sea ice at higher resolutions is still under-explored. To bridge the gap, we propose a two-staged deep learning framework, IceDiff, to forecast sea ice concentration at finer scales. IceDiff first leverages an independently trained vision transformer to generate coarse yet superior forecasting over previous methods at a regular 25km x 25km grid. This high-quality sea ice forecasting can be utilized as reliable guidance for the next stage. Subsequently, an unconditional diffusion model pre-trained on sea ice concentration maps is utilized for sampling down-scaled sea ice forecasting via a zero-shot guided sampling strategy and a patch-based method. For the first time, IceDiff demonstrates sea ice forecasting with the 6.25km x 6.25km resolution. IceDiff extends the boundary of existing sea ice forecasting models and more importantly, its capability to generate high-resolution sea ice concentration data is vital for pragmatic usages and research.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Detecting collagen by machine learning improved photoacoustic spectral analysis for breast cancer diagnostics: feasibility studies with murine models
Authors:
Jiayan Li,
Lu Bai,
Yingna Chen,
Junmei Cao,
Jingtao Zhu,
Wanxiang Zhi,
Qian Cheng
Abstract:
Collagen, a key structural component of the extracellular matrix, undergoes significant remodeling during carcinogenesis. However, the important role of collagen levels in breast cancer diagnostics still lacks effective in vivo detection techniques to provide a deeper understanding. This study presents photoacoustic spectral analysis improved by machine learning as a promising non-invasive diagnos…
▽ More
Collagen, a key structural component of the extracellular matrix, undergoes significant remodeling during carcinogenesis. However, the important role of collagen levels in breast cancer diagnostics still lacks effective in vivo detection techniques to provide a deeper understanding. This study presents photoacoustic spectral analysis improved by machine learning as a promising non-invasive diagnostic method, focusing on exploring collagen as a salient biomarker. Murine model experiments revealed more profound associations of collagen with other cancer components than in normal tissues. Moreover, an optimal set of feature wavelengths was identified by a genetic algorithm for enhanced diagnostic performance, among which 75% were from collagen-dominated absorption wavebands. Using optimal spectra, the diagnostic algorithm achieved 72% accuracy, 66% sensitivity, and 78% specificity, surpassing full-range spectra by 6%, 4%, and 8%, respectively. The proposed photoacoustic methods examine the feasibility of offering valuable biochemical insights into existing techniques, showing great potential for early-stage cancer detection.
△ Less
Submitted 10 October, 2024;
originally announced October 2024.
-
Longitudinal photoacoustic monitoring of collagen evolution modulated by cancer-associated fibroblasts: simulation and experiment studies
Authors:
Jiayan Li,
Lu Bai,
Junmei Cao,
Wenxiang Zhi,
Qian Cheng
Abstract:
Noninvasive in vivo detection of collagen facilitates the investigation of mechanisms by which cancer-associated fibroblast (CAF) regulates the extracellular matrix. This study explored the feasibility of photoacoustic spectrum analysis (PASA) in identifying longitudinal changes of collagen modulated by CAFs using simulations and experiment studies. Optical and acoustic simulations in tissues were…
▽ More
Noninvasive in vivo detection of collagen facilitates the investigation of mechanisms by which cancer-associated fibroblast (CAF) regulates the extracellular matrix. This study explored the feasibility of photoacoustic spectrum analysis (PASA) in identifying longitudinal changes of collagen modulated by CAFs using simulations and experiment studies. Optical and acoustic simulations in tissues were performed based on the histological slides of maximum cross-sections of murine malignancies to verify the effectiveness of photoacoustic (PA) detection system and the parameter "relative area of power spectrum density (APSD)". Experiments were conducted on three groups of mouse models with incremental ratios of CAFs and breast cancer cells at 3 continuous time points. Results discovered that the system configuration and APSD were capable of reflecting the evolution of collagen during cancer growth. Furthermore, cancers receiving a high dose of CAFs exhibited a suppressed collagen level. The presented methods show great potential for clinical translation of PASA in the field of cancer therapies targeting CAFs.
△ Less
Submitted 4 October, 2024;
originally announced October 2024.
-
WeatherFormer: Empowering Global Numerical Weather Forecasting with Space-Time Transformer
Authors:
Junchao Gong,
Tao Han,
Kang Chen,
Lei Bai
Abstract:
Numerical Weather Prediction (NWP) system is an infrastructure that exerts considerable impacts on modern society.Traditional NWP system, however, resolves it by solving complex partial differential equations with a huge computing cluster, resulting in tons of carbon emission. Exploring efficient and eco-friendly solutions for NWP attracts interest from Artificial Intelligence (AI) and earth scien…
▽ More
Numerical Weather Prediction (NWP) system is an infrastructure that exerts considerable impacts on modern society.Traditional NWP system, however, resolves it by solving complex partial differential equations with a huge computing cluster, resulting in tons of carbon emission. Exploring efficient and eco-friendly solutions for NWP attracts interest from Artificial Intelligence (AI) and earth science communities. To narrow the performance gap between the AI-based methods and physic predictor, this work proposes a new transformer-based NWP framework, termed as WeatherFormer, to model the complex spatio-temporal atmosphere dynamics and empowering the capability of data-driven NWP. WeatherFormer innovatively introduces the space-time factorized transformer blocks to decrease the parameters and memory consumption, in which Position-aware Adaptive Fourier Neural Operator (PAFNO) is proposed for location sensible token mixing. Besides, two data augmentation strategies are utilized to boost the performance and decrease training consumption. Extensive experiments on WeatherBench dataset show WeatherFormer achieves superior performance over existing deep learning methods and further approaches the most advanced physical model.
△ Less
Submitted 21 September, 2024;
originally announced September 2024.
-
Tracing Rayleigh-Taylor instability from measured periodic modulation in laser driven proton beams
Authors:
Z. Liu,
M. K. Zhao,
P. L. Bai,
X. J. Yang,
R. Qi,
Y. Xu,
J. W. Wang,
Y. X. Leng,
J. H. Bin,
R. X. Li
Abstract:
Rayleigh-Taylor (RT) instability occurs in a variety of scenario as a consequence of fluids of different densities pushing against the density gradient. For example, it is expected to occur in the ion acceleration of solid density targets driven by high intensity lasers and is crucial for the acceleration process. Yet, it is essential to understand the dynamics of the RT instability, a typical way…
▽ More
Rayleigh-Taylor (RT) instability occurs in a variety of scenario as a consequence of fluids of different densities pushing against the density gradient. For example, it is expected to occur in the ion acceleration of solid density targets driven by high intensity lasers and is crucial for the acceleration process. Yet, it is essential to understand the dynamics of the RT instability, a typical way to measure this phenomenon requires sophisticated diagnostics such as streak X ray radiography. Here, we report on experimental observation on periodic modulation in the energy spectrum of laser accelerated proton beams. Interestingly, theoretical model and two-dimensional particle-in-cell simulations, in good agreement with the experimental finding, indicated that such modulation is associated with periodic modulated electron density induced by transverse Rayleigh-Taylor-like instability. Furthermore, the correlation between the RT instability and the ion acceleration provides an interpretation to trace the development of the RT instability from the modulated proton spectrum. Our results thus suggest a possible tool to diagnose the evolution of the RT instability, and may have implications for further understanding for the accelerating mechanisms as well as optimization strategies for laser driven ion acceleration.
△ Less
Submitted 23 September, 2024;
originally announced September 2024.
-
A Benchmark for AI-based Weather Data Assimilation
Authors:
Wuxin Wang,
Weicheng Ni,
Tao Han,
Taikang Yuan,
Xiaoyong Li,
Lei Bai,
Boheng Duan,
Kaijun Ren
Abstract:
Recent advancements in Artificial Intelligence (AI) have led to the development of several Large Weather Models (LWMs) that rival State-Of-The-Art (SOTA) Numerical Weather Prediction (NWP) systems. Until now, these models have still relied on traditional NWP-generated analysis fields as input and are far from autonomous. Currently, scientists are increasingly focusing on developing data-driven dat…
▽ More
Recent advancements in Artificial Intelligence (AI) have led to the development of several Large Weather Models (LWMs) that rival State-Of-The-Art (SOTA) Numerical Weather Prediction (NWP) systems. Until now, these models have still relied on traditional NWP-generated analysis fields as input and are far from autonomous. Currently, scientists are increasingly focusing on developing data-driven data assimilation (DA) models for LWMs. To expedite advancements in this field and facilitate the operationalization of data-driven end-to-end weather forecasting systems, we propose DABench, a benchmark constructed by simulated observations, real-world observations, and ERA5 reanalysis. DABench contributes four standard features: (1) sparse and noisy observations provided for both simulated and real-world experiments; (2) a Skillful pre-trained Transformer-based weather prediction model, Sformer, designed to generate background fields while rigorously assessing the impact of assimilation outcomes on predictions; (3) standardized evaluation metrics for the model comparison; (4) a strong DA baseline, 4DVarFormerV2. Our experimental results demonstrate that the end-to-end weather forecasting system, integrating 4DVarFormerV2 and Sformer, can assimilate real-world observations, thereby facilitating a stable DA cycle lasting one year and achieving a skillful forecasting lead time of up to 7 days. The proposed DABench will significantly advance research in AI-based DA, AI-based weather forecasting, and related domains.
△ Less
Submitted 29 October, 2024; v1 submitted 21 August, 2024;
originally announced August 2024.
-
MambaDS: Near-Surface Meteorological Field Downscaling with Topography Constrained Selective State Space Modeling
Authors:
Zili Liu,
Hao Chen,
Lei Bai,
Wenyuan Li,
Wanli Ouyang,
Zhengxia Zou,
Zhenwei Shi
Abstract:
In an era of frequent extreme weather and global warming, obtaining precise, fine-grained near-surface weather forecasts is increasingly essential for human activities. Downscaling (DS), a crucial task in meteorological forecasting, enables the reconstruction of high-resolution meteorological states for target regions from global-scale forecast results. Previous downscaling methods, inspired by CN…
▽ More
In an era of frequent extreme weather and global warming, obtaining precise, fine-grained near-surface weather forecasts is increasingly essential for human activities. Downscaling (DS), a crucial task in meteorological forecasting, enables the reconstruction of high-resolution meteorological states for target regions from global-scale forecast results. Previous downscaling methods, inspired by CNN and Transformer-based super-resolution models, lacked tailored designs for meteorology and encountered structural limitations. Notably, they failed to efficiently integrate topography, a crucial prior in the downscaling process. In this paper, we address these limitations by pioneering the selective state space model into the meteorological field downscaling and propose a novel model called MambaDS. This model enhances the utilization of multivariable correlations and topography information, unique challenges in the downscaling process while retaining the advantages of Mamba in long-range dependency modeling and linear computational complexity. Through extensive experiments in both China mainland and the continental United States (CONUS), we validated that our proposed MambaDS achieves state-of-the-art results in three different types of meteorological field downscaling settings. We will release the code subsequently.
△ Less
Submitted 20 August, 2024;
originally announced August 2024.
-
How far are today's time-series models from real-world weather forecasting applications?
Authors:
Tao Han,
Song Guo,
Zhenghao Chen,
Wanghan Xu,
Lei Bai
Abstract:
The development of Time-Series Forecasting (TSF) techniques is often hindered by the lack of comprehensive datasets. This is particularly problematic for time-series weather forecasting, where commonly used datasets suffer from significant limitations such as small size, limited temporal coverage, and sparse spatial distribution. These constraints severely impede the optimization and evaluation of…
▽ More
The development of Time-Series Forecasting (TSF) techniques is often hindered by the lack of comprehensive datasets. This is particularly problematic for time-series weather forecasting, where commonly used datasets suffer from significant limitations such as small size, limited temporal coverage, and sparse spatial distribution. These constraints severely impede the optimization and evaluation of TSF models, resulting in benchmarks that are not representative of real-world applications, such as operational weather forecasting. In this work, we introduce the WEATHER-5K dataset, a comprehensive collection of observational weather data that better reflects real-world scenarios. As a result, it enables a better training of models and a more accurate assessment of the real-world forecasting capabilities of TSF models, pushing them closer to in-situ applications. Through extensive benchmarking against operational Numerical Weather Prediction (NWP) models, we provide researchers with a clear assessment of the gap between academic TSF models and real-world weather forecasting applications. This highlights the significant performance disparity between TSF and NWP models by analyzing performance across detailed weather variables, extreme weather event prediction, and model complexity comparison. Finally, we summarise the result into recommendations to the users and highlight potential areas required to facilitate further TSF research. The dataset and benchmark implementation are available at: https://github.com/taohan10200/WEATHER-5K.
△ Less
Submitted 11 October, 2024; v1 submitted 20 June, 2024;
originally announced June 2024.
-
Beam shaping by nonlinear moiré metasurfaces
Authors:
Lun Qu,
Wei Wu,
Di Zhang,
Chenxiong Wang,
Lu Bai,
Chenyang Li,
Wei Cai,
Mengxin Ren,
Andrea Alù,
Jingjun Xu
Abstract:
This paper explores the interplay of momentum transfer and nonlinear optical processes through moiré phenomena. Momentum transfer plays a crucial role in the interaction between photons and matter. Here, we study stacked metasurfaces with tailored dispersion and rotated against each other with varying twisted angles. The stacking introduces interlayer interactions, which can be controlled by the r…
▽ More
This paper explores the interplay of momentum transfer and nonlinear optical processes through moiré phenomena. Momentum transfer plays a crucial role in the interaction between photons and matter. Here, we study stacked metasurfaces with tailored dispersion and rotated against each other with varying twisted angles. The stacking introduces interlayer interactions, which can be controlled by the relative angle between metasurfaces, significantly enriching the resulting response compared to the single layer counterpart. By focusing on second-harmonic generation (SHG) from these twisted metasurfaces, we delve into the realm of nonlinear moiré photonics. Through experimental observations, we unveil the emergence of intricate far-field SHG radiation patterns, showing their effective tuning by varying the twisted angles. These findings offer a fresh perspective to explore nonlinear wavefront shaping through moiré phenomena, opening new avenues for nonlinear information processing, optical steering, and nonlinear optical switching.
△ Less
Submitted 20 June, 2024;
originally announced June 2024.
-
Data-driven Global Ocean Modeling for Seasonal to Decadal Prediction
Authors:
Zijie Guo,
Pumeng Lyu,
Fenghua Ling,
Lei Bai,
Jing-Jia Luo,
Niklas Boers,
Toshio Yamagata,
Takeshi Izumo,
Sophie Cravatte,
Antonietta Capotondi,
Wanli Ouyang
Abstract:
Accurate ocean dynamics modeling is crucial for enhancing understanding of ocean circulation, predicting climate variability, and tackling challenges posed by climate change. Despite improvements in traditional numerical models, predicting global ocean variability over multi-year scales remains challenging. Here, we propose ORCA-DL (Oceanic Reliable foreCAst via Deep Learning), the first data-driv…
▽ More
Accurate ocean dynamics modeling is crucial for enhancing understanding of ocean circulation, predicting climate variability, and tackling challenges posed by climate change. Despite improvements in traditional numerical models, predicting global ocean variability over multi-year scales remains challenging. Here, we propose ORCA-DL (Oceanic Reliable foreCAst via Deep Learning), the first data-driven 3D ocean model for seasonal to decadal prediction of global ocean circulation. ORCA-DL accurately simulates three-dimensional ocean dynamics and outperforms state-of-the-art dynamical models in capturing extreme events, including El Niño-Southern Oscillation and upper ocean heatwaves. This demonstrates the high potential of data-driven models for efficient and accurate global ocean forecasting. Moreover, ORCA-DL stably emulates ocean dynamics at decadal timescales, demonstrating its potential even for skillful decadal predictions and climate projections.
△ Less
Submitted 29 October, 2024; v1 submitted 24 May, 2024;
originally announced May 2024.
-
VAE-Var: Variational-Autoencoder-Enhanced Variational Assimilation
Authors:
Yi Xiao,
Qilong Jia,
Wei Xue,
Lei Bai
Abstract:
Data assimilation refers to a set of algorithms designed to compute the optimal estimate of a system's state by refining the prior prediction (known as background states) using observed data. Variational assimilation methods rely on the maximum likelihood approach to formulate a variational cost, with the optimal state estimate derived by minimizing this cost. Although traditional variational meth…
▽ More
Data assimilation refers to a set of algorithms designed to compute the optimal estimate of a system's state by refining the prior prediction (known as background states) using observed data. Variational assimilation methods rely on the maximum likelihood approach to formulate a variational cost, with the optimal state estimate derived by minimizing this cost. Although traditional variational methods have achieved great success and have been widely used in many numerical weather prediction centers, they generally assume Gaussian errors in the background states, which limits the accuracy of these algorithms due to the inherent inaccuracies of this assumption. In this paper, we introduce VAE-Var, a novel variational algorithm that leverages a variational autoencoder (VAE) to model a non-Gaussian estimate of the background error distribution. We theoretically derive the variational cost under the VAE estimation and present the general formulation of VAE-Var; we implement VAE-Var on low-dimensional chaotic systems and demonstrate through experimental results that VAE-Var consistently outperforms traditional variational assimilation methods in terms of accuracy across various observational settings.
△ Less
Submitted 22 May, 2024;
originally announced May 2024.
-
Electro-optically Modulated Nonlinear Metasurfaces
Authors:
Zhengqing He,
Lun Qu,
Wei Wu,
Jikun Liu,
Jingfei You,
Weiye Liu,
Lu Bai,
Chunyan Jin,
Chenxiong Wang,
Zhidong Gu,
Wei Cai,
Mengxin Ren,
Jingjun Xu
Abstract:
Tunable nonlinearity facilitates the creation of reconfigurable nonlinear metasurfaces, enabling innovative applications in signal processing, light switching, and sensing. This paper presents a novel approach to electrically modulate SHG from a lithium niobate (LN) metasurface, exploiting the electro-optical (EO) effect. By fabricating a nanohole array metasurface on a thin LN film and applying a…
▽ More
Tunable nonlinearity facilitates the creation of reconfigurable nonlinear metasurfaces, enabling innovative applications in signal processing, light switching, and sensing. This paper presents a novel approach to electrically modulate SHG from a lithium niobate (LN) metasurface, exploiting the electro-optical (EO) effect. By fabricating a nanohole array metasurface on a thin LN film and applying an electric field, we demonstrate the alteration of the material's refractive index, resulting in resonance shifts and modulation of SHG intensity at specific wavelengths. Our findings provide valuable insights for the development of electrically tunable nonlinear light sources, quantum optics, dynamic nonlinear holography, and nonlinear information processing.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Multiplane Quantitative Phase Imaging Using a Wavelength-Multiplexed Diffractive Optical Processor
Authors:
Che-Yung Shen,
Jingxi Li,
Tianyi Gan,
Yuhang Li,
Langxing Bai,
Mona Jarrahi,
Aydogan Ozcan
Abstract:
Quantitative phase imaging (QPI) is a label-free technique that provides optical path length information for transparent specimens, finding utility in biology, materials science, and engineering. Here, we present quantitative phase imaging of a 3D stack of phase-only objects using a wavelength-multiplexed diffractive optical processor. Utilizing multiple spatially engineered diffractive layers tra…
▽ More
Quantitative phase imaging (QPI) is a label-free technique that provides optical path length information for transparent specimens, finding utility in biology, materials science, and engineering. Here, we present quantitative phase imaging of a 3D stack of phase-only objects using a wavelength-multiplexed diffractive optical processor. Utilizing multiple spatially engineered diffractive layers trained through deep learning, this diffractive processor can transform the phase distributions of multiple 2D objects at various axial positions into intensity patterns, each encoded at a unique wavelength channel. These wavelength-multiplexed patterns are projected onto a single field-of-view (FOV) at the output plane of the diffractive processor, enabling the capture of quantitative phase distributions of input objects located at different axial planes using an intensity-only image sensor. Based on numerical simulations, we show that our diffractive processor could simultaneously achieve all-optical quantitative phase imaging across several distinct axial planes at the input by scanning the illumination wavelength. A proof-of-concept experiment with a 3D-fabricated diffractive processor further validated our approach, showcasing successful imaging of two distinct phase objects at different axial positions by scanning the illumination wavelength in the terahertz spectrum. Diffractive network-based multiplane QPI designs can open up new avenues for compact on-chip phase imaging and sensing devices.
△ Less
Submitted 16 March, 2024;
originally announced March 2024.
-
Generating Synthetic Computed Tomography for Radiotherapy: SynthRAD2023 Challenge Report
Authors:
Evi M. C. Huijben,
Maarten L. Terpstra,
Arthur Jr. Galapon,
Suraj Pai,
Adrian Thummerer,
Peter Koopmans,
Manya Afonso,
Maureen van Eijnatten,
Oliver Gurney-Champion,
Zeli Chen,
Yiwen Zhang,
Kaiyi Zheng,
Chuanpu Li,
Haowen Pang,
Chuyang Ye,
Runqi Wang,
Tao Song,
Fuxin Fan,
Jingna Qiu,
Yixing Huang,
Juhyung Ha,
Jong Sung Park,
Alexandra Alain-Beaudoin,
Silvain Bériault,
Pengxin Yu
, et al. (34 additional authors not shown)
Abstract:
Radiation therapy plays a crucial role in cancer treatment, necessitating precise delivery of radiation to tumors while sparing healthy tissues over multiple days. Computed tomography (CT) is integral for treatment planning, offering electron density data crucial for accurate dose calculations. However, accurately representing patient anatomy is challenging, especially in adaptive radiotherapy, wh…
▽ More
Radiation therapy plays a crucial role in cancer treatment, necessitating precise delivery of radiation to tumors while sparing healthy tissues over multiple days. Computed tomography (CT) is integral for treatment planning, offering electron density data crucial for accurate dose calculations. However, accurately representing patient anatomy is challenging, especially in adaptive radiotherapy, where CT is not acquired daily. Magnetic resonance imaging (MRI) provides superior soft-tissue contrast. Still, it lacks electron density information while cone beam CT (CBCT) lacks direct electron density calibration and is mainly used for patient positioning. Adopting MRI-only or CBCT-based adaptive radiotherapy eliminates the need for CT planning but presents challenges. Synthetic CT (sCT) generation techniques aim to address these challenges by using image synthesis to bridge the gap between MRI, CBCT, and CT. The SynthRAD2023 challenge was organized to compare synthetic CT generation methods using multi-center ground truth data from 1080 patients, divided into two tasks: 1) MRI-to-CT and 2) CBCT-to-CT. The evaluation included image similarity and dose-based metrics from proton and photon plans. The challenge attracted significant participation, with 617 registrations and 22/17 valid submissions for tasks 1/2. Top-performing teams achieved high structural similarity indices (>0.87/0.90) and gamma pass rates for photon (>98.1%/99.0%) and proton (>97.3%/97.0%) plans. However, no significant correlation was found between image similarity metrics and dose accuracy, emphasizing the need for dose evaluation when assessing the clinical applicability of sCT. SynthRAD2023 facilitated the investigation and benchmarking of sCT generation techniques, providing insights for developing MRI-only and CBCT-based adaptive radiotherapy.
△ Less
Submitted 11 June, 2024; v1 submitted 13 March, 2024;
originally announced March 2024.
-
Global Tropical Cyclone Intensity Forecasting with Multi-modal Multi-scale Causal Autoregressive Model
Authors:
Xinyu Wang,
Kang Chen,
Lei Liu,
Tao Han,
Bin Li,
Lei Bai
Abstract:
Accurate forecasting of Tropical cyclone (TC) intensity is crucial for formulating disaster risk reduction strategies. Current methods predominantly rely on limited spatiotemporal information from ERA5 data and neglect the causal relationships between these physical variables, failing to fully capture the spatial and temporal patterns required for intensity forecasting. To address this issue, we p…
▽ More
Accurate forecasting of Tropical cyclone (TC) intensity is crucial for formulating disaster risk reduction strategies. Current methods predominantly rely on limited spatiotemporal information from ERA5 data and neglect the causal relationships between these physical variables, failing to fully capture the spatial and temporal patterns required for intensity forecasting. To address this issue, we propose a Multi-modal multi-Scale Causal AutoRegressive model (MSCAR), which is the first model that combines causal relationships with large-scale multi-modal data for global TC intensity autoregressive forecasting. Furthermore, given the current absence of a TC dataset that offers a wide range of spatial variables, we present the Satellite and ERA5-based Tropical Cyclone Dataset (SETCD), which stands as the longest and most comprehensive global dataset related to TCs. Experiments on the dataset show that MSCAR outperforms the state-of-the-art methods, achieving maximum reductions in global and regional forecast errors of 9.52% and 6.74%, respectively. The code and dataset are publicly available at https://anonymous.4open.science/r/MSCAR.
△ Less
Submitted 16 February, 2024;
originally announced February 2024.
-
Diffusion Model-based Probabilistic Downscaling for 180-year East Asian Climate Reconstruction
Authors:
Fenghua Ling,
Zeyu Lu,
Jing-Jia Luo,
Lei Bai,
Swadhin K. Behera,
Dachao Jin,
Baoxiang Pan,
Huidong Jiang,
Toshio Yamagata
Abstract:
As our planet is entering into the "global boiling" era, understanding regional climate change becomes imperative. Effective downscaling methods that provide localized insights are crucial for this target. Traditional approaches, including computationally-demanding regional dynamical models or statistical downscaling frameworks, are often susceptible to the influence of downscaling uncertainty. He…
▽ More
As our planet is entering into the "global boiling" era, understanding regional climate change becomes imperative. Effective downscaling methods that provide localized insights are crucial for this target. Traditional approaches, including computationally-demanding regional dynamical models or statistical downscaling frameworks, are often susceptible to the influence of downscaling uncertainty. Here, we address these limitations by introducing a diffusion probabilistic downscaling model (DPDM) into the meteorological field. This model can efficiently transform data from 1° to 0.1° resolution. Compared with deterministic downscaling schemes, it not only has more accurate local details, but also can generate a large number of ensemble members based on probability distribution sampling to evaluate the uncertainty of downscaling. Additionally, we apply the model to generate a 180-year dataset of monthly surface variables in East Asia, offering a more detailed perspective for understanding local scale climate change over the past centuries.
△ Less
Submitted 5 April, 2024; v1 submitted 1 February, 2024;
originally announced February 2024.
-
FengWu-GHR: Learning the Kilometer-scale Medium-range Global Weather Forecasting
Authors:
Tao Han,
Song Guo,
Fenghua Ling,
Kang Chen,
Junchao Gong,
Jingjia Luo,
Junxia Gu,
Kan Dai,
Wanli Ouyang,
Lei Bai
Abstract:
Kilometer-scale modeling of global atmosphere dynamics enables fine-grained weather forecasting and decreases the risk of disastrous weather and climate activity. Therefore, building a kilometer-scale global forecast model is a persistent pursuit in the meteorology domain. Active international efforts have been made in past decades to improve the spatial resolution of numerical weather models. Non…
▽ More
Kilometer-scale modeling of global atmosphere dynamics enables fine-grained weather forecasting and decreases the risk of disastrous weather and climate activity. Therefore, building a kilometer-scale global forecast model is a persistent pursuit in the meteorology domain. Active international efforts have been made in past decades to improve the spatial resolution of numerical weather models. Nonetheless, developing the higher resolution numerical model remains a long-standing challenge due to the substantial consumption of computational resources. Recent advances in data-driven global weather forecasting models utilize reanalysis data for model training and have demonstrated comparable or even higher forecasting skills than numerical models. However, they are all limited by the resolution of reanalysis data and incapable of generating higher-resolution forecasts. This work presents FengWu-GHR, the first data-driven global weather forecasting model running at the 0.09$^{\circ}$ horizontal resolution. FengWu-GHR introduces a novel approach that opens the door for operating ML-based high-resolution forecasts by inheriting prior knowledge from a pretrained low-resolution model. The hindcast of weather prediction in 2022 indicates that FengWu-GHR is superior to the IFS-HRES. Furthermore, evaluations on station observations and case studies of extreme events support the competitive operational forecasting skill of FengWu-GHR at the high resolution.
△ Less
Submitted 28 January, 2024;
originally announced February 2024.
-
Improving Global Weather and Ocean Wave Forecast with Large Artificial Intelligence Models
Authors:
Fenghua Ling,
Lin Ouyang,
Boufeniza Redouane Larbi,
Jing-Jia Luo,
Tao Han,
Xiaohui Zhong,
Lei Bai
Abstract:
The rapid advancement of artificial intelligence technologies, particularly in recent years, has led to the emergence of several large parameter artificial intelligence weather forecast models. These models represent a significant breakthrough, overcoming the limitations of traditional numerical weather prediction models and indicating the emergence of profound potential tools for atmosphere-ocean…
▽ More
The rapid advancement of artificial intelligence technologies, particularly in recent years, has led to the emergence of several large parameter artificial intelligence weather forecast models. These models represent a significant breakthrough, overcoming the limitations of traditional numerical weather prediction models and indicating the emergence of profound potential tools for atmosphere-ocean forecasts. This study explores the evolution of these advanced artificial intelligence forecast models, and based on the identified commonalities, proposes the "Three Large Rules" to measure their development. We discuss the potential of artificial intelligence in revolutionizing numerical weather prediction, and briefly outlining the underlying reasons for its great potential. While acknowledging the high accuracy, computational efficiency, and ease of deployment of large artificial intelligence forecast models, we also emphasize the irreplaceable values of traditional numerical forecasts and explore the challenges in the future development of large-scale artificial intelligence atmosphere-ocean forecast models. We believe that the optimal future of atmosphere-ocean weather forecast lies in achieving a seamless integration of artificial intelligence and traditional numerical models. Such a synthesis is anticipated to offer a more advanced and reliable approach for improved atmosphere-ocean forecasts. Additionally, we illustrate how forecasters can adapt and leverage the advanced artificial intelligence model through an example by building a large artificial intelligence model for global ocean wave forecast.
△ Less
Submitted 18 April, 2024; v1 submitted 29 January, 2024;
originally announced January 2024.
-
Towards an end-to-end artificial intelligence driven global weather forecasting system
Authors:
Kun Chen,
Lei Bai,
Fenghua Ling,
Peng Ye,
Tao Chen,
Jing-Jia Luo,
Hao Chen,
Yi Xiao,
Kang Chen,
Tao Han,
Wanli Ouyang
Abstract:
The weather forecasting system is important for science and society, and significant achievements have been made in applying artificial intelligence (AI) to medium-range weather forecasting. However, existing AI-based weather forecasting models rely on analysis or reanalysis products from traditional numerical weather prediction (NWP) systems as initial conditions for making predictions. Initial s…
▽ More
The weather forecasting system is important for science and society, and significant achievements have been made in applying artificial intelligence (AI) to medium-range weather forecasting. However, existing AI-based weather forecasting models rely on analysis or reanalysis products from traditional numerical weather prediction (NWP) systems as initial conditions for making predictions. Initial states are typically generated by traditional data assimilation components, which are computational expensive and time-consuming. Here we present an AI-based data assimilation model, i.e., Adas, for global weather variables. By introducing the confidence matrix, Adas employs gated convolution to handle sparse observations and gated cross-attention for capturing the interactions between the background and observations. Further, we combine Adas with the advanced AI-based forecasting model (i.e., FengWu) to construct the first end-to-end AI-based global weather forecasting system: FengWu-Adas. We demonstrate that Adas can assimilate global observations to produce high-quality analysis, enabling the system operate stably for long term. Moreover, we are the first to apply the methods to real-world scenarios, which is more challenging and has considerable practical application potential. We have also achieved the forecasts based on the analyses generated by AI with a skillful forecast lead time exceeding that of the IFS for the first time.
△ Less
Submitted 8 April, 2024; v1 submitted 18 December, 2023;
originally announced December 2023.
-
FengWu-4DVar: Coupling the Data-driven Weather Forecasting Model with 4D Variational Assimilation
Authors:
Yi Xiao,
Lei Bai,
Wei Xue,
Kang Chen,
Tao Han,
Wanli Ouyang
Abstract:
Weather forecasting is a crucial yet highly challenging task. With the maturity of Artificial Intelligence (AI), the emergence of data-driven weather forecasting models has opened up a new paradigm for the development of weather forecasting systems. Despite the significant successes that have been achieved (e.g., surpassing advanced traditional physical models for global medium-range forecasting),…
▽ More
Weather forecasting is a crucial yet highly challenging task. With the maturity of Artificial Intelligence (AI), the emergence of data-driven weather forecasting models has opened up a new paradigm for the development of weather forecasting systems. Despite the significant successes that have been achieved (e.g., surpassing advanced traditional physical models for global medium-range forecasting), existing data-driven weather forecasting models still rely on the analysis fields generated by the traditional assimilation and forecasting system, which hampers the significance of data-driven weather forecasting models regarding both computational cost and forecasting accuracy. In this work, we explore the possibility of coupling the data-driven weather forecasting model with data assimilation by integrating the global AI weather forecasting model, FengWu, with one of the most popular assimilation algorithms, Four-Dimensional Variational (4DVar) assimilation, and develop an AI-based cyclic weather forecasting system, FengWu-4DVar. FengWu-4DVar can incorporate observational data into the data-driven weather forecasting model and consider the temporal evolution of atmospheric dynamics to obtain accurate analysis fields for making predictions in a cycling manner without the help of physical models. Owning to the auto-differentiation ability of deep learning models, FengWu-4DVar eliminates the need of developing the cumbersome adjoint model, which is usually required in the traditional implementation of the 4DVar algorithm. Experiments on the simulated observational dataset demonstrate that FengWu-4DVar is capable of generating reasonable analysis fields for making accurate and efficient iterative predictions.
△ Less
Submitted 19 May, 2024; v1 submitted 15 December, 2023;
originally announced December 2023.
-
ResoNet: Robust and Explainable ENSO Forecasts with Hybrid Convolution and Transformer Networks
Authors:
Pumeng Lyu,
Tao Tang,
Fenghua Ling,
Jing-Jia Luo,
Niklas Boers,
Wanli Ouyang,
Lei Bai
Abstract:
Recent studies have shown that deep learning (DL) models can skillfully predict the El Niño-Southern Oscillation (ENSO) forecasts over 1.5 years ahead. However, concerns regarding the reliability of predictions made by DL methods persist, including potential overfitting issues and lack of interpretability. Here, we propose ResoNet, a DL model that combines convolutional neural network (CNN) and Tr…
▽ More
Recent studies have shown that deep learning (DL) models can skillfully predict the El Niño-Southern Oscillation (ENSO) forecasts over 1.5 years ahead. However, concerns regarding the reliability of predictions made by DL methods persist, including potential overfitting issues and lack of interpretability. Here, we propose ResoNet, a DL model that combines convolutional neural network (CNN) and Transformer architectures. This hybrid architecture design enables our model to adequately capture local SSTA as well as long-range inter-basin interactions across oceans. We show that ResoNet can robustly predict ESNO at lead times between 19 and 26 months, thus outperforming existing approaches in terms of the forecast horizon. According to an explainability method applied to ResoNet predictions of El Niño and La Niña events from 1- to 18-month lead, we find that it predicts the Niño3.4 index based on multiple physically reasonable mechanisms, such as the Recharge Oscillator concept, Seasonal Footprint Mechanism, and Indian Ocean capacitor effect. Moreover, we demonstrate that for the first time, the asymmetry between El Niño and La Niña development can be captured by ResoNet. Our results could help alleviate skepticism about applying DL models for ENSO prediction and encourage more attempts to discover and predict climate phenomena using AI methods.
△ Less
Submitted 16 December, 2023;
originally announced December 2023.
-
Geometry-enhanced Pre-training on Interatomic Potentials
Authors:
Taoyong Cui,
Chenyu Tang,
Mao Su,
Shufei Zhang,
Yuqiang Li,
Lei Bai,
Yuhan Dong,
Xingao Gong,
Wanli Ouyang
Abstract:
Machine learning interatomic potentials (MLIPs) enables molecular dynamics (MD) simulations with ab initio accuracy and has been applied to various fields of physical science. However, the performance and transferability of MLIPs are limited by insufficient labeled training data, which require expensive ab initio calculations to obtain the labels, especially for complex molecular systems. To addre…
▽ More
Machine learning interatomic potentials (MLIPs) enables molecular dynamics (MD) simulations with ab initio accuracy and has been applied to various fields of physical science. However, the performance and transferability of MLIPs are limited by insufficient labeled training data, which require expensive ab initio calculations to obtain the labels, especially for complex molecular systems. To address this challenge, we design a novel geometric structure learning paradigm that consists of two stages. We first generate a large quantity of 3D configurations of target molecular system with classical molecular dynamics simulations. Then, we propose geometry-enhanced self-supervised learning consisting of masking, denoising, and contrastive learning to better capture the topology and 3D geometric information from the unlabeled 3D configurations. We evaluate our method on various benchmarks ranging from small molecule datasets to complex periodic molecular systems with more types of elements. The experimental results show that the proposed pre-training method can greatly enhance the accuracy of MLIPs with few extra computational costs and works well with different invariant or equivariant graph neural network architectures. Our method improves the generalization capability of MLIPs and helps to realize accurate MD simulations for complex molecular systems.
△ Less
Submitted 12 April, 2024; v1 submitted 27 September, 2023;
originally announced September 2023.
-
FengWu: Pushing the Skillful Global Medium-range Weather Forecast beyond 10 Days Lead
Authors:
Kang Chen,
Tao Han,
Junchao Gong,
Lei Bai,
Fenghua Ling,
Jing-Jia Luo,
Xi Chen,
Leiming Ma,
Tianning Zhang,
Rui Su,
Yuanzheng Ci,
Bin Li,
Xiaokang Yang,
Wanli Ouyang
Abstract:
We present FengWu, an advanced data-driven global medium-range weather forecast system based on Artificial Intelligence (AI). Different from existing data-driven weather forecast methods, FengWu solves the medium-range forecast problem from a multi-modal and multi-task perspective. Specifically, a deep learning architecture equipped with model-specific encoder-decoders and cross-modal fusion Trans…
▽ More
We present FengWu, an advanced data-driven global medium-range weather forecast system based on Artificial Intelligence (AI). Different from existing data-driven weather forecast methods, FengWu solves the medium-range forecast problem from a multi-modal and multi-task perspective. Specifically, a deep learning architecture equipped with model-specific encoder-decoders and cross-modal fusion Transformer is elaborately designed, which is learned under the supervision of an uncertainty loss to balance the optimization of different predictors in a region-adaptive manner. Besides this, a replay buffer mechanism is introduced to improve medium-range forecast performance. With 39-year data training based on the ERA5 reanalysis, FengWu is able to accurately reproduce the atmospheric dynamics and predict the future land and atmosphere states at 37 vertical levels on a 0.25° latitude-longitude resolution. Hindcasts of 6-hourly weather in 2018 based on ERA5 demonstrate that FengWu performs better than GraphCast in predicting 80\% of the 880 reported predictands, e.g., reducing the root mean square error (RMSE) of 10-day lead global z500 prediction from 733 to 651 $m^{2}/s^2$. In addition, the inference cost of each iteration is merely 600ms on NVIDIA Tesla A100 hardware. The results suggest that FengWu can significantly improve the forecast skill and extend the skillful global medium-range weather forecast out to 10.75 days lead (with ACC of z500 > 0.6) for the first time.
△ Less
Submitted 6 April, 2023;
originally announced April 2023.
-
Acceleration of 60 MeV proton beams in the commissioning experiment of SULF-10 PW laser
Authors:
A. X. Li,
C. Y. Qin,
H. Zhang,
S. Li,
L. L. Fan,
Q. S. Wang,
T. J. Xu,
N. W. Wang,
L. H. Yu,
Y. Xu,
Y. Q. Liu,
C. Wang,
X. L. Wang,
Z. X. Zhang,
X. Y. Liu,
P. L. Bai,
Z. B. Gan,
X. B. Zhang,
X. B. Wang,
C. Fan,
Y. J. Sun,
Y. H. Tang,
B. Yao,
X. Y. Liang,
Y. X. Leng
, et al. (3 additional authors not shown)
Abstract:
We report the experimental results of the commissioning phase in the 10 PW laser beamline of Shanghai Superintense Ultrafast Laser Facility (SULF). The peak power reaches 2.4 PW on target without the last amplifying during the experiment. The laser energy of 72\pm 9 J is directed to a focal spot of ~6 μm diameter (FWHM) in 30 fs pulse duration, yielding a focused peak intensity around 2.0 \times 1…
▽ More
We report the experimental results of the commissioning phase in the 10 PW laser beamline of Shanghai Superintense Ultrafast Laser Facility (SULF). The peak power reaches 2.4 PW on target without the last amplifying during the experiment. The laser energy of 72\pm 9 J is directed to a focal spot of ~6 μm diameter (FWHM) in 30 fs pulse duration, yielding a focused peak intensity around 2.0 \times 10^{21} W/cm^2. First laser-proton acceleration experiment is performed using plain copper and plastic targets. High-energy proton beams with maximum cut-off energy up to 62.5 MeV are achieved using copper foils at the optimum target thickness of 4 μm via target normal sheath acceleration (TNSA). For plastic targets of tens of nanometers thick, the proton cut-off energy is approximately 20 MeV, showing ring-like or filamented density distributions. These experimental results reflect the capabilities of the SULF-10 PW beamline, e.g., both ultrahigh intensity and relatively good beam contrast. Further optimization for these key parameters is underway, where peak laser intensities of 10^{22}-10^{23} W/cm^2 are anticipated to support various experiments on extreme field physics.
△ Less
Submitted 14 July, 2022;
originally announced July 2022.
-
Quantum-enhanced rubidium atomic magnetometer based on Faraday rotation via 795-nm Stokes operator squeezed light
Authors:
Lele Bai,
Xin Wen,
Yulin Yang,
Lulu Zhang,
Jun He,
Yanhua Wang,
Junmin Wang
Abstract:
With the help of Stokes operator S2 squeezed state (also called polarization squeezed state (PSS)) of 795-nm light, rubidium-87 (87Rb) atomic magnetometer based on Faraday rotation has been implemented and characterized.The PSS of Stokes operator S2 of 795-nm light has been prepared by means of coherently combining the polarization coherent state (PCS) of a linearly p-polarized bright 795-nm light…
▽ More
With the help of Stokes operator S2 squeezed state (also called polarization squeezed state (PSS)) of 795-nm light, rubidium-87 (87Rb) atomic magnetometer based on Faraday rotation has been implemented and characterized.The PSS of Stokes operator S2 of 795-nm light has been prepared by means of coherently combining the polarization coherent state (PCS) of a linearly p-polarized bright 795-nm light beam and a linearly s-polarized squeezed vacuum state (SVS) generated by a 397.5-nm ultraviolet laser pumped sub-threshold optical parametric oscillator (OPO) with a PPKTP bulk crystal inside the OPO cavity.PSS with a squeezing level of -3.7 has been achieved around the analysis frequency of 10 kHz. At different transitions of D1 line, various frequency detuning, and reasonable atomic vapor cells temperature, Faraday rotation has been measured and compared.To decrease absorption (scattering) losses and the back-action from atomic spin noise to the probe beams polarization noise for maintaining the quantum properties of PSS of Stokes operator S2 of 795-nm light, we had to run our magnetometer with 87Rb vapor cells temperature below 60, at which the PSS was almost destroyed.The sensitivities of magnetic field measurement were characterized via measuring signal-to-noise ratio of the alternating current (AC) calibrated magnetic field signal with a balanced polarimeter. Under the conditions of the atomic number density of 5.8*1010 /cm3 and the probe beam with a detuning of - 400 MHz relative to the 5S1/2 (Fg=2) - 5P1/2 (Fe=1) transition of 87Rb D1 line, a typical sensitivity of 19.5 pT/Hz1/2 has been achieved employing PSS of Stokes operator S2 as the probe, compared with a sensitivity of 28.3 pT/Hz1/2 using PCS as the probe.We preliminarily demonstrated that the quantum-enhanced sensitivity in a Faraday-rotation-based 87Rb atomic magnetometer with the help of PSS of 795-nm light.
△ Less
Submitted 5 December, 2021; v1 submitted 30 November, 2021;
originally announced December 2021.
-
Enhancement of spin noise spectroscopy of rubidium atomic ensemble by using of the polarization squeezed light
Authors:
Lele Bai,
Lulu Zhang,
Yongbiao Yang,
Rui Chang,
Yao Qin,
Jun He,
Xin Wen,
Junmin Wang
Abstract:
We measured the spin noise spectroscopy (SNS) of rubidium atomic ensemble with two different atomic vapor cells (filled with the buffer gases or coated with paraffin film on the inner wall), and demonstrated the enhancement of signal to noise ratio (SNR) by using of the polarization squeezed state (PSS) of 795 nm light field with Stokes operator S2 squeezed. PSS is prepared by locking the relative…
▽ More
We measured the spin noise spectroscopy (SNS) of rubidium atomic ensemble with two different atomic vapor cells (filled with the buffer gases or coated with paraffin film on the inner wall), and demonstrated the enhancement of signal to noise ratio (SNR) by using of the polarization squeezed state (PSS) of 795 nm light field with Stokes operator S2 squeezed. PSS is prepared by locking the relative phase between the squeezed vacuum state of light obtained by a sub-threshold optical parametric oscillator and the orthogonal polarized local oscillator beam by means of the quantum noise lock. Under the same conditions, PSS can be employed not only to improve SNR, but also to keep the full width at half maximum (FWHM) of SNS unchanged, compared with the case of using polarization coherent state (PCS), and the enhancement of SNR is positively correlated with the squeezing level of PSS. With the increase of probe laser power and atomic number density, the SNR and FWHM of SNS will increase correspondingly. With the help of PSS of Stokes operator S2, quantum enhancement of both SNR and FWHM of SNS signal has been demonstrated by controlling optical power of the S2 polarization squeezed light beam or atomic number density in our experiments.
△ Less
Submitted 18 November, 2021;
originally announced November 2021.
-
Accurate force field of two-dimensional ferroelectrics from deep learning
Authors:
Jing Wu,
Liyi Bai,
Jiawei Huang,
Liyang Ma,
Jian Liu,
Shi Liu
Abstract:
The discovery of two-dimensional (2D) ferroelectrics with switchable out-of-plane polarization such as monolayer $α$-In$_2$Se$_3$ offers a new avenue for ultrathin high-density ferroelectric-based nanoelectronics such as ferroelectric field effect transistors and memristors. The functionality of ferroelectrics depends critically on the dynamics of polarization switching in response to an external…
▽ More
The discovery of two-dimensional (2D) ferroelectrics with switchable out-of-plane polarization such as monolayer $α$-In$_2$Se$_3$ offers a new avenue for ultrathin high-density ferroelectric-based nanoelectronics such as ferroelectric field effect transistors and memristors. The functionality of ferroelectrics depends critically on the dynamics of polarization switching in response to an external electric/stress field. Unlike the switching dynamics in bulk ferroelectrics that have been extensively studied, the mechanisms and dynamics of polarization switching in 2D remain largely unexplored. Molecular dynamics (MD) using classical force fields is a reliable and efficient method for large-scale simulations of dynamical processes with atomic resolution. Here we developed a deep neural network-based force field of monolayer In$_2$Se$_3$ using a concurrent learning procedure that efficiently updates the first-principles-based training database. The model potential has accuracy comparable with density functional theory (DFT), capable of predicting a range of thermodynamic properties of In$_2$Se$_3$ polymorphs and lattice dynamics of ferroelectric In$_2$Se$_3$. Pertinent to the switching dynamics, the model potential also reproduces the DFT kinetic pathways of polarization reversal and 180$^\circ$ domain wall motions. Moreover, isobaric-isothermal ensemble MD simulations predict a temperature-driven $α\rightarrow β$ phase transition at the single-layer limit, as revealed by both local atomic displacement and Steinhardt's bond orientational order parameter $Q_4$. Our work paves the way for further research on the dynamics of ferroelectric $α$-In$_2$Se$_3$ and related systems.
△ Less
Submitted 29 October, 2021; v1 submitted 15 September, 2021;
originally announced September 2021.
-
Dual camera snapshot hyperspectral imaging system via physics informed learning
Authors:
Hui Xie,
Zhuang Zhao,
Jing Han,
Yi Zhang,
Lianfa Bai,
Jun Lu
Abstract:
We consider using the system's optical imaging process with convolutional neural networks (CNNs) to solve the snapshot hyperspectral imaging reconstruction problem, which uses a dual-camera system to capture the three-dimensional hyperspectral images (HSIs) in a compressed way. Various methods using CNNs have been developed in recent years to reconstruct HSIs, but most of the supervised deep learn…
▽ More
We consider using the system's optical imaging process with convolutional neural networks (CNNs) to solve the snapshot hyperspectral imaging reconstruction problem, which uses a dual-camera system to capture the three-dimensional hyperspectral images (HSIs) in a compressed way. Various methods using CNNs have been developed in recent years to reconstruct HSIs, but most of the supervised deep learning methods aimed to fit a brute-force mapping relationship between the captured compressed image and standard HSIs. Thus, the learned mapping would be invalid when the observation data deviate from the training data. Especially, we usually don't have ground truth in real-life scenarios. In this paper, we present a self-supervised dual-camera equipment with an untrained physics-informed CNNs framework. Extensive simulation and experimental results show that our method without training can be adapted to a wide imaging environment with good performance. Furthermore, compared with the training-based methods, our system can be constantly fine-tuned and self-improved in real-life scenarios.
△ Less
Submitted 17 November, 2021; v1 submitted 6 September, 2021;
originally announced September 2021.
-
Construction and On-site Performance of the LHAASO WFCTA Camera
Authors:
F. Aharonian,
Q. An,
Axikegu,
L. X. Bai,
Y. X. Bai,
Y. W. Bao,
D. Bastieri,
X. J. Bi,
Y. J. Bi,
H. Cai,
J. T. Cai,
Z. Cao,
Z. Cao,
J. Chang,
J. F. Chang,
X. C. Chang,
B. M. Chen,
J. Chen,
L. Chen,
L. Chen,
L. Chen,
M. J. Chen,
M. L. Chen,
Q. H. Chen,
S. H. Chen
, et al. (234 additional authors not shown)
Abstract:
The focal plane camera is the core component of the Wide Field-of-view Cherenkov/fluorescence Telescope Array (WFCTA) of the Large High-Altitude Air Shower Observatory (LHAASO). Because of the capability of working under moonlight without aging, silicon photomultipliers (SiPM) have been proven to be not only an alternative but also an improvement to conventional photomultiplier tubes (PMT) in this…
▽ More
The focal plane camera is the core component of the Wide Field-of-view Cherenkov/fluorescence Telescope Array (WFCTA) of the Large High-Altitude Air Shower Observatory (LHAASO). Because of the capability of working under moonlight without aging, silicon photomultipliers (SiPM) have been proven to be not only an alternative but also an improvement to conventional photomultiplier tubes (PMT) in this application. Eighteen SiPM-based cameras with square light funnels have been built for WFCTA. The telescopes have collected more than 100 million cosmic ray events and preliminary results indicate that these cameras are capable of working under moonlight. The characteristics of the light funnels and SiPMs pose challenges (e.g. dynamic range, dark count rate, assembly techniques). In this paper, we present the design features, manufacturing techniques and performances of these cameras. Finally, the test facilities, the test methods and results of SiPMs in the cameras are reported here.
△ Less
Submitted 4 July, 2021; v1 submitted 29 December, 2020;
originally announced December 2020.
-
Laser Intensity Noise Suppression for Preparing Audio-Frequency 795 nm Squeezed Vacuum State of Light at Rubidium D1 Line
Authors:
Lele Bai,
Xin Wen,
Yulin Yang,
Jun He,
Junmin Wang
Abstract:
Laser intensity noise suppression has essential effects on preparation and characterization of the audio-frequency squeezed vacuum state of light based on a sub-threshold optical parametric oscillator (OPO).We have implemented two feedback loops by using relevant acousto-optical modulators (AOM) to stabilize the intensity of 795-nm near infrared (NIR) fundamental laser and 397.5-nm ultraviolet (UV…
▽ More
Laser intensity noise suppression has essential effects on preparation and characterization of the audio-frequency squeezed vacuum state of light based on a sub-threshold optical parametric oscillator (OPO).We have implemented two feedback loops by using relevant acousto-optical modulators (AOM) to stabilize the intensity of 795-nm near infrared (NIR) fundamental laser and 397.5-nm ultraviolet (UV) laser generated by cavity-enhanced frequency doubling.Typical peak-to-peak laser intensity fluctuation with a bandwidth of $\sim10$ kHz in a half hour has been improved from $\pm7.45$$\%$ to $\pm0.06$$\%$ for 795-nm NIR laser beam, and from $\pm9.04$$\%$ to $\pm0.05$$\%$ for 397.5-nm UV laser beam, respectively. The squeezing level of the squeezed vacuum state at 795 nm prepared by the sub-threshold OPO with a PPKTP crystal has been improved from -3.3 to -4.0 dB around 3$\sim$9 kHz of audio analysis frequency range.
△ Less
Submitted 10 March, 2020;
originally announced March 2020.
-
Synchronous locating and imaging behind scattering medium in a large depth based on deep learning
Authors:
Shuo Zhu,
Enlai Guo,
Qianying Cui,
Dongliang Zheng,
Lianfa Bai,
Jing Han
Abstract:
Scattering medium brings great difficulties to locate and image planar objects especially when the object has a large depth. In this letter, a novel learning-based method is presented to locate and image the object hidden behind a thin scattering diffuser. A multi-task network, named DINet, is constructed to predict the depth and the image of the hidden object from the captured speckle patterns. T…
▽ More
Scattering medium brings great difficulties to locate and image planar objects especially when the object has a large depth. In this letter, a novel learning-based method is presented to locate and image the object hidden behind a thin scattering diffuser. A multi-task network, named DINet, is constructed to predict the depth and the image of the hidden object from the captured speckle patterns. The provided experiments verify that the proposed method enables to locate the object with a depth mean error less than 0.05 mm, and image the object with an average PSNR above 24 dB, in a large depth ranging from 350 mm to 1150 mm. The constructed DINet can obtain multiple physical information via a single speckle pattern, including both the depth and image. Comparing with the traditional methods, it paves the way to the practical applications requiring large imaging depth of field behind scattering media.
△ Less
Submitted 29 May, 2020; v1 submitted 25 October, 2019;
originally announced October 2019.
-
Learning-based real-time method to looking through scattering medium beyond the memory effect
Authors:
Enlai Guo,
Shuo Zhu,
Yan Sun,
Lianfa Bai,
Jing Han
Abstract:
Strong scattering medium brings great difficulties to optical imaging, which is also a problem in medical imaging and many other fields. Optical memory effect makes it possible to image through strong random scattering medium. However, this method also has the limitation of limited angle field-of-view (FOV), which prevents it from being applied in practice. In this paper, a kind of practical convo…
▽ More
Strong scattering medium brings great difficulties to optical imaging, which is also a problem in medical imaging and many other fields. Optical memory effect makes it possible to image through strong random scattering medium. However, this method also has the limitation of limited angle field-of-view (FOV), which prevents it from being applied in practice. In this paper, a kind of practical convolutional neural network called PDSNet is proposed, which effectively breaks through the limitation of optical memory effect on FOV. Experiments is conducted to prove that the scattered pattern can be reconstructed accurately in real-time by PDSNet, and it is widely applicable to retrieve complex objects of random scales and different scattering media.
△ Less
Submitted 4 November, 2019; v1 submitted 19 October, 2019;
originally announced October 2019.
-
High-SNR snapshot multiplex spectrometer with sub-Hadamard-S matrix coding
Authors:
Zhuang Zhao,
Lianfa Bai,
Jing Han,
Jiang Yue
Abstract:
We present a robust high signal-to-noise ratio (SNR) snapshot multiplex spectrometer with sub-Hadamard-S matrix coding. We demonstrated for the first time that the sub-Hadamard-S matrix coding could provide comparable SNR improvement with Hadamard-S matrix in Hadamard transform spectrometer (HTS). Normally, HTS should change the coding mask to obtain a reasonable spectrum result, causing unexpecte…
▽ More
We present a robust high signal-to-noise ratio (SNR) snapshot multiplex spectrometer with sub-Hadamard-S matrix coding. We demonstrated for the first time that the sub-Hadamard-S matrix coding could provide comparable SNR improvement with Hadamard-S matrix in Hadamard transform spectrometer (HTS). Normally, HTS should change the coding mask to obtain a reasonable spectrum result, causing unexpected time-consuming. An extra imaging path to collect the light intensity of the aperture is added in this paper. Both light intensity of the aperture and overlapped spectra are captured within one shot, turning Hadamard-S matrix coding into sub-Hadamard-S matrix coding. Simulations and experiments show that the proposed method could obtain comparable SNR improvement with the traditional HTS, maintaining snapshot.
△ Less
Submitted 18 May, 2019;
originally announced May 2019.
-
Indirect Coupling between Two Cavity Photon Systems via Ferromagnetic Resonance
Authors:
Paul Hyde,
Lihui Bai,
Michael Harder,
Christophe Match,
Can-Ming Hu
Abstract:
We experimentally realize indirect coupling between two cavity modes via strong coupling with the ferromagnetic resonance in Yttrium Iron Garnet (YIG). We find that some indirectly coupled modes of our system can have a higher microwave transmission than the individual uncoupled modes. Using a coupled harmonic oscillator model, the influence of the oscillation phase difference between the two cavi…
▽ More
We experimentally realize indirect coupling between two cavity modes via strong coupling with the ferromagnetic resonance in Yttrium Iron Garnet (YIG). We find that some indirectly coupled modes of our system can have a higher microwave transmission than the individual uncoupled modes. Using a coupled harmonic oscillator model, the influence of the oscillation phase difference between the two cavity modes on the nature of the indirect coupling is revealed. These indirectly coupled microwave modes can be controlled using an external magnetic field or by tuning the cavity height. This work has potential for use in controllable optical devices and information processing technologies.
△ Less
Submitted 10 June, 2016;
originally announced June 2016.
-
One laser pulse generates two photoacoustic signals
Authors:
Fei Gao,
Xiaohua Feng,
Linyi Bai,
Ruochong Zhang,
Siyu Liu,
Ran Ding,
Rahul Kishor,
Yanli Zhao,
Yuanjin Zheng
Abstract:
Photoacoustic sensing and imaging techniques have been studied widely to explore optical absorption contrast based on nanosecond laser illumination. In this paper, we report a long laser pulse induced dual photoacoustic (LDPA) nonlinear effect, which originates from unsatisfied stress and thermal confinements. Being different from conventional short laser pulse illumination, the proposed method ut…
▽ More
Photoacoustic sensing and imaging techniques have been studied widely to explore optical absorption contrast based on nanosecond laser illumination. In this paper, we report a long laser pulse induced dual photoacoustic (LDPA) nonlinear effect, which originates from unsatisfied stress and thermal confinements. Being different from conventional short laser pulse illumination, the proposed method utilizes a long square-profile laser pulse to induce dual photoacoustic signals. Without satisfying the stress confinement, the dual photoacoustic signals are generated following the positive and negative edges of the long laser pulse. More interestingly, the first expansion-induced photoacoustic signal exhibits positive waveform due to the initial sharp rising of temperature. On the contrary, the second contraction-induced photoacoustic signal exhibits exactly negative waveform due to the falling of temperature, as well as pulse-width-dependent signal amplitude which is caused by the concurrent heat accumulation and thermal diffusion during the long laser illumination. An analytical model is derived to describe the generation of the dual photoacoustic pulses, incorporating Gruneisen saturation and thermal diffusion effect, which is experimentally proved. Lastly, an alternate of LDPA technique using quasi-CW laser excitation is also introduced and demonstrated for both super-contrast in vitro and in vivo imaging. Compared with existing nonlinear PA techniques, the proposed LDPA nonlinear effect could enable a much broader range of potential applications.
△ Less
Submitted 18 May, 2016; v1 submitted 25 February, 2016;
originally announced February 2016.