-
A robust synthetic data generation framework for machine learning in High-Resolution Transmission Electron Microscopy (HRTEM)
Authors:
Luis Rangel DaCosta,
Katherine Sytwu,
Catherine Groschner,
Mary Scott
Abstract:
Machine learning techniques are attractive options for developing highly-accurate automated analysis tools for nanomaterials characterization, including high-resolution transmission electron microscopy (HRTEM). However, successfully implementing such machine learning tools can be difficult due to the challenges in procuring sufficiently large, high-quality training datasets from experiments. In th…
▽ More
Machine learning techniques are attractive options for developing highly-accurate automated analysis tools for nanomaterials characterization, including high-resolution transmission electron microscopy (HRTEM). However, successfully implementing such machine learning tools can be difficult due to the challenges in procuring sufficiently large, high-quality training datasets from experiments. In this work, we introduce Construction Zone, a Python package for rapidly generating complex nanoscale atomic structures, and develop an end-to-end workflow for creating large simulated databases for training neural networks. Construction Zone enables fast, systematic sampling of realistic nanomaterial structures, and can be used as a random structure generator for simulated databases, which is important for generating large, diverse synthetic datasets. Using HRTEM imaging as an example, we train a series of neural networks on various subsets of our simulated databases to segment nanoparticles and holistically study the data curation process to understand how various aspects of the curated simulated data -- including simulation fidelity, the distribution of atomic structures, and the distribution of imaging conditions -- affect model performance across several experimental benchmarks. Using our results, we are able to achieve state-of-the-art segmentation performance on experimental HRTEM images of nanoparticles from several experimental benchmarks and, further, we discuss robust strategies for consistently achieving high performance with machine learning in experimental settings using purely synthetic data.
△ Less
Submitted 12 September, 2023;
originally announced September 2023.
-
Generalization Across Experimental Parameters in Machine Learning Analysis of High Resolution Transmission Electron Microscopy Datasets
Authors:
Katherine Sytwu,
Luis Rangel DaCosta,
Mary C. Scott
Abstract:
Neural networks are promising tools for high-throughput and accurate transmission electron microscopy (TEM) analysis of nanomaterials, but are known to generalize poorly on data that is "out-of-distribution" from their training data. Given the limited set of image features typically seen in high-resolution TEM imaging, it is unclear which images are considered out-of-distribution from others. Here…
▽ More
Neural networks are promising tools for high-throughput and accurate transmission electron microscopy (TEM) analysis of nanomaterials, but are known to generalize poorly on data that is "out-of-distribution" from their training data. Given the limited set of image features typically seen in high-resolution TEM imaging, it is unclear which images are considered out-of-distribution from others. Here, we investigate how the choice of metadata features in the training dataset influences neural network performance, focusing on the example task of nanoparticle segmentation. We train and validate neural networks across curated, experimentally-collected high-resolution TEM image datasets of nanoparticles under controlled imaging and material parameters, including magnification, dosage, nanoparticle diameter, and nanoparticle material. Overall, we find that our neural networks are not robust across microscope parameters, but do generalize across certain sample parameters. Additionally, data preprocessing heavily influences the generalizability of neural networks trained on nominally similar datasets. Our results highlight the need to understand how dataset features affect deployment of data-driven algorithms.
△ Less
Submitted 20 June, 2023;
originally announced June 2023.
-
Understanding the Influence of Receptive Field and Network Complexity in Neural-Network-Guided TEM Image Analysis
Authors:
Katherine Sytwu,
Catherine Groschner,
Mary C. Scott
Abstract:
Trained neural networks are promising tools to analyze the ever-increasing amount of scientific image data, but it is unclear how to best customize these networks for the unique features in transmission electron micrographs. Here, we systematically examine how neural network architecture choices affect how neural networks segment, or pixel-wise separate, crystalline nanoparticles from amorphous ba…
▽ More
Trained neural networks are promising tools to analyze the ever-increasing amount of scientific image data, but it is unclear how to best customize these networks for the unique features in transmission electron micrographs. Here, we systematically examine how neural network architecture choices affect how neural networks segment, or pixel-wise separate, crystalline nanoparticles from amorphous background in transmission electron microscopy (TEM) images. We focus on decoupling the influence of receptive field, or the area of the input image that contributes to the output decision, from network complexity, which dictates the number of trainable parameters. We find that for low-resolution TEM images which rely on amplitude contrast to distinguish nanoparticles from background, the receptive field does not significantly influence segmentation performance. On the other hand, for high-resolution TEM images which rely on a combination of amplitude and phase contrast changes to identify nanoparticles, receptive field is a key parameter for increased performance, especially in images with minimal amplitude contrast. Our results provide insight and guidance as to how to adapt neural networks for applications with TEM datasets.
△ Less
Submitted 8 April, 2022;
originally announced April 2022.
-
Driving energetically-unfavorable dehydrogenation dynamics with plasmonics
Authors:
Katherine Sytwu,
Michal Vadai,
Fariah Hayee,
Daniel K. Angell,
Alan Dai,
Jefferson Dixon,
Jennifer Dionne
Abstract:
Nanoparticle surface structure and geometry generally dictate where chemical transformations occur, with the low-coordination-number, high-radius-of-curvature sites being energetically-preferred. Here, we show how optical excitation of plasmons enables spatially-controlled chemical transformations, including access to sites which, without illumination, would be energetically-unfavorable. We design…
▽ More
Nanoparticle surface structure and geometry generally dictate where chemical transformations occur, with the low-coordination-number, high-radius-of-curvature sites being energetically-preferred. Here, we show how optical excitation of plasmons enables spatially-controlled chemical transformations, including access to sites which, without illumination, would be energetically-unfavorable. We design a crossed-bar Au-PdHx antenna-reactor system that localizes electromagnetic enhancement away from the innately reactive PdHx nanorod tips. Using optically-coupled in situ environmental transmission electron microscopy, we track the dehydrogenation of individual antenna-reactor pairs with varying optical illumination intensity, wavelength, and hydrogen pressure. Our in situ experiments show that plasmons enable new catalytic sites, including hydrogenation dissociation at the nanorod faces. Molecular dynamics simulations confirm that these new nucleation sites are energetically unfavorable in equilibrium and only accessible via tailored plasmonic excitation.
△ Less
Submitted 10 September, 2020;
originally announced September 2020.
-
Weakly Explosive Percolation in Directed Networks
Authors:
Shane Squires,
Katherine Sytwu,
Diego Alcala,
Thomas Antonsen,
Edward Ott,
Michelle Girvan
Abstract:
Percolation, the formation of a macroscopic connected component, is a key feature in the description of complex networks. The dynamical properties of a variety of systems can be understood in terms of percolation, including the robustness of power grids and information networks, the spreading of epidemics and forest fires, and the stability of gene regulatory networks. Recent studies have shown th…
▽ More
Percolation, the formation of a macroscopic connected component, is a key feature in the description of complex networks. The dynamical properties of a variety of systems can be understood in terms of percolation, including the robustness of power grids and information networks, the spreading of epidemics and forest fires, and the stability of gene regulatory networks. Recent studies have shown that if network edges are added "competitively" in undirected networks, the onset of percolation is abrupt or "explosive." The unusual qualitative features of this phase transition have been the subject of much recent attention. Here we generalize this previously studied network growth process from undirected networks to directed networks and use finite-size scaling theory to find several scaling exponents. We find that this process is also characterized by a very rapid growth in the giant component, but that this growth is not as sudden as in undirected networks.
△ Less
Submitted 20 June, 2013; v1 submitted 9 March, 2013;
originally announced March 2013.