-
STONet: A novel neural operator for modeling solute transport in micro-cracked reservoirs
Authors:
Ehsan Haghighat,
Mohammad Hesan Adeli,
S Mohammad Mousavi,
Ruben Juanes
Abstract:
In this work, we develop a novel neural operator, the Solute Transport Operator Network (STONet), to efficiently model contaminant transport in micro-cracked reservoirs. The model combines different networks to encode heterogeneous properties effectively. By predicting the concentration rate, we are able to accurately model the transport process. Numerical experiments demonstrate that our neural o…
▽ More
In this work, we develop a novel neural operator, the Solute Transport Operator Network (STONet), to efficiently model contaminant transport in micro-cracked reservoirs. The model combines different networks to encode heterogeneous properties effectively. By predicting the concentration rate, we are able to accurately model the transport process. Numerical experiments demonstrate that our neural operator approach achieves accuracy comparable to that of the finite element method. The previously introduced Enriched DeepONet architecture has been revised, motivated by the architecture of the popular multi-head attention of transformers, to improve its performance without increasing the compute cost. The computational efficiency of the proposed model enables rapid and accurate predictions of solute transport, facilitating the optimization of reservoir management strategies and the assessment of environmental impacts. The data and code for the paper will be published at https://github.com/ehsanhaghighat/STONet.
△ Less
Submitted 7 December, 2024;
originally announced December 2024.
-
Gemini & Physical World: Large Language Models Can Estimate the Intensity of Earthquake Shaking from Multi-Modal Social Media Posts
Authors:
S. Mostafa Mousavi,
Marc Stogaitis,
Tajinder Gadh,
Richard M Allen,
Alexei Barski,
Robert Bosch,
Patrick Robertson,
Nivetha Thiruverahan,
Youngmin Cho,
Aman Raj
Abstract:
This paper presents a novel approach to extract scientifically valuable information about Earth's physical phenomena from unconventional sources, such as multi-modal social media posts. Employing a state-of-the-art large language model (LLM), Gemini 1.5 Pro (Reid et al. 2024), we estimate earthquake ground shaking intensity from these unstructured posts. The model's output, in the form of Modified…
▽ More
This paper presents a novel approach to extract scientifically valuable information about Earth's physical phenomena from unconventional sources, such as multi-modal social media posts. Employing a state-of-the-art large language model (LLM), Gemini 1.5 Pro (Reid et al. 2024), we estimate earthquake ground shaking intensity from these unstructured posts. The model's output, in the form of Modified Mercalli Intensity (MMI) values, aligns well with independent observational data. Furthermore, our results suggest that LLMs, trained on vast internet data, may have developed a unique understanding of physical phenomena. Specifically, Google's Gemini models demonstrate a simplified understanding of the general relationship between earthquake magnitude, distance, and MMI intensity, accurately describing observational data even though it's not identical to established models. These findings raise intriguing questions about the extent to which Gemini's training has led to a broader understanding of the physical world and its phenomena. The ability of Generative AI models like Gemini to generate results consistent with established scientific knowledge highlights their potential to augment our understanding of complex physical phenomena like earthquakes. The flexible and effective approach proposed in this study holds immense potential for enriching our understanding of the impact of physical phenomena and improving resilience during natural disasters. This research is a significant step toward harnessing the power of social media and AI for natural disaster mitigation, opening new avenues for understanding the emerging capabilities of Generative AI and LLMs for scientific applications.
△ Less
Submitted 14 June, 2024; v1 submitted 28 May, 2024;
originally announced May 2024.
-
QuakeFlow: A Scalable Machine-learning-based Earthquake Monitoring Workflow with Cloud Computing
Authors:
Weiqiang Zhu,
Alvin Brian Hou,
Robert Yang,
Avoy Datta,
S. Mostafa Mousavi,
William L. Ellsworth,
Gregory C. Beroza
Abstract:
Earthquake monitoring workflows are designed to detect earthquake signals and to determine source characteristics from continuous waveform data. Recent developments in deep learning seismology have been used to improve tasks within earthquake monitoring workflows that allow the fast and accurate detection of up to orders of magnitude more small events than are present in conventional catalogs. To…
▽ More
Earthquake monitoring workflows are designed to detect earthquake signals and to determine source characteristics from continuous waveform data. Recent developments in deep learning seismology have been used to improve tasks within earthquake monitoring workflows that allow the fast and accurate detection of up to orders of magnitude more small events than are present in conventional catalogs. To facilitate the application of machine-learning algorithms to large-volume seismic records, we developed a cloud-based earthquake monitoring workflow, QuakeFlow, that applies multiple processing steps to generate earthquake catalogs from raw seismic data. QuakeFlow uses a deep learning model, PhaseNet, for picking P/S phases and a machine learning model, GaMMA, for phase association with approximate earthquake location and magnitude. Each component in QuakeFlow is containerized, allowing straightforward updates to the pipeline with new deep learning/machine learning models, as well as the ability to add new components, such as earthquake relocation algorithms. We built QuakeFlow in Kubernetes to make it auto-scale for large datasets and to make it easy to deploy on cloud platforms, which enables large-scale parallel processing. We used QuakeFlow to process three years of continuous archived data from Puerto Rico, and found more than a factor of ten more events that occurred on much the same structures as previously known seismicity. We applied Quakeflow to monitoring frequent earthquakes in Hawaii and found over an order of magnitude more events than are in the standard catalog, including many events that illuminate the deep structure of the magmatic system. We also added Kafka and Spark streaming to deliver real-time earthquake monitoring results. QuakeFlow is an effective and efficient approach both for improving realtime earthquake monitoring and for mining archived seismic data sets.
△ Less
Submitted 30 August, 2022;
originally announced August 2022.
-
An End-to-End Earthquake Detection Method for Joint Phase Picking and Association using Deep Learning
Authors:
Weiqiang Zhu,
Kai Sheng Tai,
S. Mostafa Mousavi,
Peter Bailis,
Gregory C. Beroza
Abstract:
Earthquake monitoring by seismic networks typically involves a workflow consisting of phase detection/picking, association, and location tasks. In recent years, the accuracy of these individual stages has been improved through the use of machine learning techniques. In this study, we introduce a new, end-to-end approach that improves overall earthquake detection accuracy by jointly optimizing each…
▽ More
Earthquake monitoring by seismic networks typically involves a workflow consisting of phase detection/picking, association, and location tasks. In recent years, the accuracy of these individual stages has been improved through the use of machine learning techniques. In this study, we introduce a new, end-to-end approach that improves overall earthquake detection accuracy by jointly optimizing each stage of the detection pipeline. We propose a neural network architecture for the task of multi-station processing of seismic waveforms recorded over a seismic network. This end-to-end architecture consists of three sub-networks: a backbone network that extracts features from raw waveforms, a phase picking sub-network that picks P- and S-wave arrivals based on these features, and an event detection sub-network that aggregates the features from multiple stations and detects earthquakes. We use these sub-networks in conjunction with a shift-and-stack module based on back-projection that introduces kinematic constraints on arrival times, allowing the model to generalize to different velocity models and to variable station geometry in seismic networks. We evaluate our proposed method on the STanford EArthquake Dataset (STEAD) and on the 2019 Ridgecrest, CA earthquake sequence. The results demonstrate that our end-to-end approach can effectively pick P- and S-wave arrivals and achieve earthquake detection accuracy rivaling that of other state-of-the-art approaches.
△ Less
Submitted 20 September, 2021;
originally announced September 2021.
-
Earthquake Phase Association using a Bayesian Gaussian Mixture Model
Authors:
Weiqiang Zhu,
Ian W. McBrearty,
S. Mostafa Mousavi,
William L. Ellsworth,
Gregory C. Beroza
Abstract:
Earthquake phase association algorithms aggregate picked seismic phases from a network of seismometers into individual earthquakes and play an important role in earthquake monitoring. Dense seismic networks and improved phase picking methods produce massive earthquake phase data sets, particularly for earthquake swarms and aftershocks occurring closely in time and space, making phase association a…
▽ More
Earthquake phase association algorithms aggregate picked seismic phases from a network of seismometers into individual earthquakes and play an important role in earthquake monitoring. Dense seismic networks and improved phase picking methods produce massive earthquake phase data sets, particularly for earthquake swarms and aftershocks occurring closely in time and space, making phase association a challenging problem. We present a new association method, the Gaussian Mixture Model Association (GaMMA), that combines the Gaussian mixture model for phase measurements (both time and amplitude), with earthquake location, origin time, and magnitude estimation. We treat earthquake phase association as an unsupervised clustering problem in a probabilistic framework, where each earthquake corresponds to a cluster of P and S phases with hyperbolic moveout of arrival times and a decay of amplitude with distance. We use a multivariate Gaussian distribution to model the collection of phase picks for an event, the mean of which is given by the predicted arrival time and amplitude from the causative event. We carry out the pick assignment for each earthquake and determine earthquake parameters (i.e., earthquake location, origin time, and magnitude) under the maximum likelihood criterion using the Expectation-Maximization (EM) algorithm. The GaMMA method does not require the typical association steps of other algorithms, such as grid-search or supervised training. The results on both synthetic test and the 2019 Ridgecrest earthquake sequence show that GaMMA effectively associates phases from a temporally and spatially dense earthquake sequence while producing useful estimates of earthquake location and magnitude.
△ Less
Submitted 18 September, 2021;
originally announced September 2021.
-
Dark Matter Effects on Stellar Populations in Globular Clusters
Authors:
Ebrahim Hassani,
Seyyed Milad Ghaffarpour Mousavi
Abstract:
According to the classical view of globular clusters, stars inside globular clusters are evolved from the same giant molecular cloud. Then their stars' chemical compositions must be the same. But recent photometric and spectroscopic studies of globular clusters reveal the presence of more-than-one stellar populations inside globular clusters. This finding challenges our classical view of globular…
▽ More
According to the classical view of globular clusters, stars inside globular clusters are evolved from the same giant molecular cloud. Then their stars' chemical compositions must be the same. But recent photometric and spectroscopic studies of globular clusters reveal the presence of more-than-one stellar populations inside globular clusters. This finding challenges our classical view of globular clusters.
In this work, we investigated the possibility of solving multiple stellar populations problem in globular clusters using dark matter assumptions. We showed that the presence of dark matter inside globular clusters changes the physical parameters (e.g. chemical composition, luminosity, temperature, age, etc.) of stars inside them. We supposed that dark matter distributed non-uniformly inside globular clusters. It means stars in high dark matter density environments (like the central region of globular clusters) are more affected by the presence of dark matter. Using this assumption, we showed that stars in different locations of globular clusters (corresponding to different dark matter densities) follow different evolutionary paths (e.g. on Hertzsprung-Russell diagram). We used this note to infer that the presence of dark matter inside globular clusters can be the reason for the multiple stellar populations.
△ Less
Submitted 28 April, 2022; v1 submitted 7 June, 2021;
originally announced June 2021.
-
Wall entrapment of peritrichous bacteria: A mesoscale hydrodynamics simulation study
Authors:
S. Mahdiyeh Mousavi,
Gerhard Gompper,
Roland G. Winkler
Abstract:
Microswimmers such as E. Coli bacteria accumulate and exhibit an intriguing dynamics near walls, governed by hydrodynamic and steric interactions. Insight into the underlying mechanisms and predominant interactions demand a detailed characterization of the entrapment process. We employ a mesoscale hydrodynamics simulation approach to study entrapment of a E. coli-type cell at a no-slip wall. The c…
▽ More
Microswimmers such as E. Coli bacteria accumulate and exhibit an intriguing dynamics near walls, governed by hydrodynamic and steric interactions. Insight into the underlying mechanisms and predominant interactions demand a detailed characterization of the entrapment process. We employ a mesoscale hydrodynamics simulation approach to study entrapment of a E. coli-type cell at a no-slip wall. The cell is modeled by a spherocylindrical body with several explicit helical flagella. Three stages of the entrapment process can be distinguished: the approaching regime, where a cell swims toward the wall on a nearly straight trajectory; a scattering regime, where the cell touches the wall, with an reorientation; and a surface-swimming regime. Our simulations show that steric interactions may dominate the entrapment process, yet, hydrodynamic interactions slow down the adsorption dynamics close to the boundary and imply a circular motion on the wall. The locomotion of the cell is characterized by a strong wobbling dynamics, with cells preferentially pointing toward the wall.
△ Less
Submitted 2 July, 2020;
originally announced July 2020.
-
Bayesian-Deep-Learning Estimation of Earthquake Location from Single-Station Observations
Authors:
S. Mostafa Mousavi,
Gregory C. Beroza
Abstract:
We present a deep learning method for single-station earthquake location, which we approach as a regression problem using two separate Bayesian neural networks. We use a multi-task temporal-convolutional neural network to learn epicentral distance and P travel time from 1-minute seismograms. The network estimates epicentral distance and P travel time with absolute mean errors of 0.23 km and 0.03 s…
▽ More
We present a deep learning method for single-station earthquake location, which we approach as a regression problem using two separate Bayesian neural networks. We use a multi-task temporal-convolutional neural network to learn epicentral distance and P travel time from 1-minute seismograms. The network estimates epicentral distance and P travel time with absolute mean errors of 0.23 km and 0.03 s respectively, along with their epistemic and aleatory uncertainties. We design a separate multi-input network using standard convolutional layers to estimate the back-azimuth angle, and its epistemic uncertainty. This network estimates the direction from which seismic waves arrive to the station with a mean error of 1 degree. Using this information, we estimate the epicenter, origin time, and depth along with their confidence intervals. We use a global dataset of earthquake signals recorded within 1 degree (~112 km) from the event to build the model and to demonstrate its performance. Our model can predict epicenter, origin time, and depth with mean errors of 7.3 km, 0.4 second, and 6.7 km respectively, at different locations around the world. Our approach can be used for fast earthquake source characterization with a limited number of observations, and also for estimating location of earthquakes that are sparsely recorded -- either because they are small or because stations are widely separated.
△ Less
Submitted 2 December, 2019;
originally announced December 2019.
-
A Machine-Learning Approach for Earthquake Magnitude Estimation
Authors:
S. Mostafa Mousavi,
Gregory C. Beroza
Abstract:
In this study we develop a single-station deep-learning approach for fast and reliable estimation of earthquake magnitude directly from raw waveforms. We design a regressor composed of convolutional and recurrent neural networks that is not sensitive to the data normalization, hence waveform amplitude information can be utilized during the training. Our network can predict earthquake magnitudes wi…
▽ More
In this study we develop a single-station deep-learning approach for fast and reliable estimation of earthquake magnitude directly from raw waveforms. We design a regressor composed of convolutional and recurrent neural networks that is not sensitive to the data normalization, hence waveform amplitude information can be utilized during the training. Our network can predict earthquake magnitudes with an average error close to zero and standard deviation of ~0.2 based on single-station waveforms without instrument response correction. We test the network for both local and duration magnitude scales and show a station-based learning can be an effective approach for improving the performance. The proposed approach has a variety of potential applications from routine earthquake monitoring to early warning systems.
△ Less
Submitted 14 November, 2019;
originally announced November 2019.
-
Seismic Signal Denoising and Decomposition Using Deep Neural Networks
Authors:
Weiqiang Zhu,
S. Mostafa Mousavi,
Gregory C. Beroza
Abstract:
Denoising and filtering are widely used in routine seismic-data-processing to improve the signal-to-noise ratio (SNR) of recorded signals and by doing so to improve subsequent analyses. In this paper we develop a new denoising/decomposition method, DeepDenoiser, based on a deep neural network. This network is able to learn simultaneously a sparse representation of data in the time-frequency domain…
▽ More
Denoising and filtering are widely used in routine seismic-data-processing to improve the signal-to-noise ratio (SNR) of recorded signals and by doing so to improve subsequent analyses. In this paper we develop a new denoising/decomposition method, DeepDenoiser, based on a deep neural network. This network is able to learn simultaneously a sparse representation of data in the time-frequency domain and a non-linear function that maps this representation into masks that decompose input data into a signal of interest and noise (defined as any non-seismic signal). We show that DeepDenoiser achieves impressive denoising of seismic signals even when the signal and noise share a common frequency band. Our method properly handles a variety of colored noise and non-earthquake signals. DeepDenoiser can significantly improve the SNR with minimal changes in the waveform shape of interest, even in presence of high noise levels. We demonstrate the effect of our method on improving earthquake detection. There are clear applications of DeepDenoiser to seismic imaging, micro-seismic monitoring, and preprocessing of ambient noise data. We also note that potential applications of our approach are not limited to these applications or even to earthquake data, and that our approach can be adapted to diverse signals and applications in other settings.
△ Less
Submitted 6 November, 2018;
originally announced November 2018.
-
Clustering of Janus Particles in Optical Potential Driven by Hydrodynamic Fluxes
Authors:
S. Masoumeh Mousavi,
Sabareesh K. P. Velu,
Agnese Callegari,
Luca Biancofiore,
Giovanni Volpe
Abstract:
Self-organisation is driven by the interactions between the individual components of a system mediated by the environment, and is one of the most important strategies used by many biological systems to develop complex and functional structures. Furthermore, biologically-inspired self-organisation offers opportunities to develop the next generation of materials and devices for electronics, photonic…
▽ More
Self-organisation is driven by the interactions between the individual components of a system mediated by the environment, and is one of the most important strategies used by many biological systems to develop complex and functional structures. Furthermore, biologically-inspired self-organisation offers opportunities to develop the next generation of materials and devices for electronics, photonics and nanotechnology. In this work, we demonstrate experimentally that a system of Janus particles (silica microspheres half-coated with gold) aggregates into clusters in the presence of a Gaussian optical potential and disaggregates when the optical potential is switched off. We show that the underlying mechanism is the existence of a hydrodynamic flow induced by a temperature gradient generated by the light absorption at the metallic patches on the Janus particles. We also perform simulations, which agree well with the experiments and whose results permit us to clarify the underlying mechanism. The possibility of hydrodynamic-flux-induced reversible clustering may have applications in the fields of drug delivery, cargo transport, bioremediation and biopatterning.
△ Less
Submitted 5 November, 2018;
originally announced November 2018.
-
Effect of quantum interference on the optical properties of a three-level V-type atomic system beyond the two-photon resonance condition
Authors:
S. M. Mousavi,
L. Safari,
M. Mahmoudi,
M. Sahrai
Abstract:
The effect of quantum interference on the optical properties of a pumped-probe three-level V-type atomic system is investigated. The probe absorption, dispersion, group index and optical bistability beyond the two-photon resonance condition are discussed. It is found that the optical properties of a medium in the frequency of the probe field, in general, are phase independent. The phase dependence…
▽ More
The effect of quantum interference on the optical properties of a pumped-probe three-level V-type atomic system is investigated. The probe absorption, dispersion, group index and optical bistability beyond the two-photon resonance condition are discussed. It is found that the optical properties of a medium in the frequency of the probe field, in general, are phase independent. The phase dependence arises from a scattering of the coupling field into the probe field at a frequency which in general differs from the probe field frequency. It is demonstrated that beyond the two-photon resonance condition the phase sensitivity of the medium will disappear.
△ Less
Submitted 27 April, 2012;
originally announced April 2012.