Search | arXiv e-print repository

Optimal Experimental Design Criteria for Data-Consistent Inversion

Authors: Troy Butler, John Jakeman, Michael Pilosov, Scott Walsh, Timothy Wildey

Abstract: The ability to design effective experiments is crucial for obtaining data that can substantially reduce the uncertainty in the predictions made using computational models. An optimal experimental design (OED) refers to the choice of a particular experiment that optimizes a particular design criteria, e.g., maximizing a utility function, which measures the information content of the data. However,… ▽ More The ability to design effective experiments is crucial for obtaining data that can substantially reduce the uncertainty in the predictions made using computational models. An optimal experimental design (OED) refers to the choice of a particular experiment that optimizes a particular design criteria, e.g., maximizing a utility function, which measures the information content of the data. However, traditional approaches for optimal experimental design typically require solving a large number of computationally intensive inverse problems to find the data that maximizes the utility function. Here, we introduce two novel OED criteria that are specifically crafted for the data consistent inversion (DCI) framework, but do not require solving inverse problems. DCI is a specific approach for solving a class of stochastic inverse problems by constructing a pullback measure on uncertain parameters from an observed probability measure on the outputs of a quantity of interest (QoI) map. While expected information gain (EIG) has been used for both DCI and Bayesian based OED, the characteristics and properties of DCI solutions differ from those of solutions to Bayesian inverse problems which should be reflected in the OED criteria. The new design criteria developed in this study, called the expected scaling effect and the expected skewness effect, leverage the geometric structure of pre-images associated with observable data sets, allowing for an intuitive and computationally efficient approach to OED. These criteria utilize singular value computations derived from sampled and approximated Jacobians of the experimental designs. We present both simultaneous and sequential (greedy) formulations of OED based on these innovative criteria. Numerical results demonstrate the effectiveness in our approach for solving stochastic inverse problems. △ Less

Submitted 13 June, 2025; originally announced June 2025.

arXiv:2504.14320 [pdf, other]

Expanding the Generative AI Design Space through Structured Prompting and Multimodal Interfaces

Authors: Nimisha Karnatak, Adrien Baranes, Rob Marchant, Huinan Zeng, Tríona Butler, Kristen Olson

Abstract: Text-based prompting remains the predominant interaction paradigm in generative AI, yet it often introduces friction for novice users such as small business owners (SBOs), who struggle to articulate creative goals in domain-specific contexts like advertising. Through a formative study with six SBOs in the United Kingdom, we identify three key challenges: difficulties in expressing brand intuition… ▽ More Text-based prompting remains the predominant interaction paradigm in generative AI, yet it often introduces friction for novice users such as small business owners (SBOs), who struggle to articulate creative goals in domain-specific contexts like advertising. Through a formative study with six SBOs in the United Kingdom, we identify three key challenges: difficulties in expressing brand intuition through prompts, limited opportunities for fine-grained adjustment and refinement during and after content generation, and the frequent production of generic content that lacks brand specificity. In response, we present ACAI (AI Co-Creation for Advertising and Inspiration), a multimodal generative AI tool designed to support novice designers by moving beyond traditional prompt interfaces. ACAI features a structured input system composed of three panels: Branding, Audience and Goals, and the Inspiration Board. These inputs allow users to convey brand-relevant context and visual preferences. This work contributes to HCI research on generative systems by showing how structured interfaces can foreground user-defined context, improve alignment, and enhance co-creative control in novice creative workflows. △ Less

Submitted 22 April, 2025; v1 submitted 19 April, 2025; originally announced April 2025.

Comments: Accepted at CHI'25 Workshop on Designing and Developing User Interfaces with AI

arXiv:2503.06729 [pdf, other]

ACAI for SBOs: AI Co-creation for Advertising and Inspiration for Small Business Owners

Authors: Nimisha Karnatak, Adrien Baranes, Rob Marchant, Triona Butler, Kristen Olson

Abstract: Small business owners (SBOs) often lack the resources and design experience needed to produce high-quality advertisements. To address this, we developed ACAI (AI Co-Creation for Advertising and Inspiration), an GenAI-powered multimodal advertisement creation tool, and conducted a user study with 16 SBOs in London to explore their perceptions of and interactions with ACAI in advertisement creation.… ▽ More Small business owners (SBOs) often lack the resources and design experience needed to produce high-quality advertisements. To address this, we developed ACAI (AI Co-Creation for Advertising and Inspiration), an GenAI-powered multimodal advertisement creation tool, and conducted a user study with 16 SBOs in London to explore their perceptions of and interactions with ACAI in advertisement creation. Our findings reveal that structured inputs enhance user agency and control while improving AI outputs by facilitating better brand alignment, enhancing AI transparency, and offering scaffolding that assists novice designers, such as SBOs, in formulating prompts. We also found that ACAI's multimodal interface bridges the design skill gap for SBOs with a clear advertisement vision, but who lack the design jargon necessary for effective prompting. Building on our findings, we propose three capabilities: contextual intelligence, adaptive interactions, and data management, with corresponding design recommendations to advance the co-creative attributes of AI-mediated design tools. △ Less

Submitted 9 March, 2025; originally announced March 2025.

arXiv:2503.01761 [pdf]

A comprehensive and reliable protocol for manual segmentation of the human claustrum using high-resolution MRI

Authors: Steven Seung-Suk Kang, Joseph Bodenheimer, Kayley Morris, Tracey Butler

Abstract: The claustrum is a thin gray matter structure in each brain hemisphere, characterized by exceptionally high connectivity with nearly all brain regions. Despite extensive animal studies on its anatomy and function and growing evidence of claustral deficits in neuropsychiatric disorders, its specific roles in normal and abnormal human brain function remain largely unknown. This is primarily due to i… ▽ More The claustrum is a thin gray matter structure in each brain hemisphere, characterized by exceptionally high connectivity with nearly all brain regions. Despite extensive animal studies on its anatomy and function and growing evidence of claustral deficits in neuropsychiatric disorders, its specific roles in normal and abnormal human brain function remain largely unknown. This is primarily due to its thin and complex morphology, which limits accurate anatomical delineation and neural activity isolation in conventional in vivo neuroimaging. To facilitate future neuroimaging studies, we developed a comprehensive and reliable manual segmentation protocol based on a cellular-resolution brain atlas and high-resolution (0.7^3 mm) MRI data. The protocols involve detailed guidelines to delineate the entire claustrum, including the inferior parts that have not been clearly described in earlier MRI studies. Additionally, we propose a geometric method to parcellate the claustrum into three subregions (the dorsal, ventral, and temporal claustrum) along the superior-to-inferior axis. The mean bilateral claustrum volume in 10 young adults was 3307.5 mm^3, approximately 0.21% of total intracranial volume. Our segmentation protocol demonstrated high inter- and intra-rater reliability (ICC > 0.89, DSC > 0.85), confirming its replicability. This comprehensive and reliable manual segmentation protocol offers a robust foundation for anatomically precise neuroimaging investigations of the human claustrum. △ Less

Submitted 15 May, 2025; v1 submitted 3 March, 2025; originally announced March 2025.

arXiv:2412.10149 [pdf, ps, other]

Learning Radical Excited States from Sparse Data

Authors: Jingkun Shen, Lucy E. Walker, Kevin Ma, James D. Green, Hugo Bronstein, Keith T. Butler, Timothy J. H. Hele

Abstract: Emissive organic radicals are currently of great interest for their potential use in the next generation of highly efficient organic light emitting diode (OLED) devices and as molecular qubits. However, simulating their optoelectronic properties is challenging, largely due to spin-contamination and the multireference character of their excited states. Here we present a data-driven approach where,… ▽ More Emissive organic radicals are currently of great interest for their potential use in the next generation of highly efficient organic light emitting diode (OLED) devices and as molecular qubits. However, simulating their optoelectronic properties is challenging, largely due to spin-contamination and the multireference character of their excited states. Here we present a data-driven approach where, for the first time, the excited electronic states of organic radicals are learned directly from experimental excited state data, using a much smaller amount of data than typically required by Machine Learning. We adopt ExROPPP, a fast and spin-pure semiempirical method for calculation of the excited states of radicals, as a surrogate physical model for which we learn the optimal set of parameters. To achieve this we compile the largest known database of organic radical geometries and their UV-vis data, which we use to train our model. Our trained model gives Root Mean Square (RMS) and mean absolute errors for excited state energies of 0.24 and 0.16 eV respectively, improving hugely over ExROPPP with literature parameters. Four new organic radicals are synthesised and we test the model on their spectra, finding even lower errors and similar correlation as for the testing set. This model paves the way for the high throughput discovery of next generation radical-based optoelectronics. △ Less

Submitted 12 June, 2025; v1 submitted 13 December, 2024; originally announced December 2024.

arXiv:2407.13814 [pdf, ps, other]

Building Population-Informed Priors for Bayesian Inference Using Data-Consistent Stochastic Inversion

Authors: Rebekah D. White, John D. Jakeman, Tim Wildey, Troy Butler

Abstract: Bayesian inference provides a powerful tool for leveraging observational data to inform model predictions and uncertainties. However, when such data is limited, Bayesian inference may not adequately constrain uncertainty without the use of highly informative priors. Common approaches for constructing informative priors typically rely on either assumptions or knowledge of the underlying physics, wh… ▽ More Bayesian inference provides a powerful tool for leveraging observational data to inform model predictions and uncertainties. However, when such data is limited, Bayesian inference may not adequately constrain uncertainty without the use of highly informative priors. Common approaches for constructing informative priors typically rely on either assumptions or knowledge of the underlying physics, which may not be available in all scenarios. In this work, we consider the scenario where data are available on a population of assets/individuals, which occurs in many problem domains such as biomedical or digital twin applications, and leverage this population-level data to systematically constrain the Bayesian prior and subsequently improve individualized inferences. The approach proposed in this paper is based upon a recently developed technique known as data-consistent inversion (DCI) for constructing a pullback probability measure. Succinctly, we utilize DCI to build population-informed priors for subsequent Bayesian inference on individuals. While the approach is general and applies to nonlinear maps and arbitrary priors, we prove that for linear inverse problems with Gaussian priors, the population-informed prior produces an increase in the information gain as measured by the determinant and trace of the inverse posterior covariance. We also demonstrate that the Kullback-Leibler divergence often improves with high probability. Numerical results, including linear-Gaussian examples and one inspired by digital twins for additively manufactured assets, indicate that there is significant value in using these population-informed priors. △ Less

Submitted 24 June, 2025; v1 submitted 18 July, 2024; originally announced July 2024.

Comments: Corrected error in Algorithm 1. Small changes to illustrative examples and introductory text

arXiv:2406.13142 [pdf, other]

doi 10.1038/s41524-024-01486-1

Optimal pre-train/fine-tune strategies for accurate material property predictions

Authors: Reshma Devi, Keith T. Butler, Gopalakrishnan Sai Gautam

Abstract: Overcoming the challenge of limited data availability within materials science is crucial for the broad-based applicability of machine learning within materials science. One pathway to overcome this limited data availability is to use the framework of transfer learning (TL), where a pre-trained (PT) machine learning model (on a larger dataset) can be fine-tuned (FT) on a target (typically smaller)… ▽ More Overcoming the challenge of limited data availability within materials science is crucial for the broad-based applicability of machine learning within materials science. One pathway to overcome this limited data availability is to use the framework of transfer learning (TL), where a pre-trained (PT) machine learning model (on a larger dataset) can be fine-tuned (FT) on a target (typically smaller) dataset. Our study systematically explores the effectiveness of various PT/FT strategies to learn and predict material properties with limited data. Specifically, we leverage graph neural networks (GNNs) to PT/FT on seven diverse curated materials datasets, encompassing sizes ranging from 941 to 132,752 datapoints. We consider datasets that cover a spectrum of material properties, ranging from band gaps (electronic) to formation energies (thermodynamic) and shear moduli (mechanical). We study the influence of PT and FT dataset sizes, strategies that can be employed for FT, and other hyperparameters on pair-wise TL among the datasets considered. We find our pair-wise PT-FT models to consistently outperform models trained from scratch on the target datasets. Importantly, we develop a GNN framework that is simultaneously PT on multiple properties (MPT), enabling the construction of generalized GNN models. Our MPT models outperform pair-wise PT-FT models on several datasets considered, and more significantly, on a 2D material band gap dataset that is completely out-of-distribution from the PT datasets. Finally, we expect our PT/FT and MPT frameworks to be generalizable to other GNNs and materials properties, which can accelerate materials design and discovery for various applications. △ Less

Submitted 18 June, 2024; originally announced June 2024.

arXiv:2405.08307 [pdf, other]

Sequential Maximal Updated Density Parameter Estimation for Dynamical Systems with Parameter Drift

Authors: Carlos del-Castillo-Negrete, Rylan Spence, Troy Butler, Clint Dawson

Abstract: We present a novel method for generating sequential parameter estimates and quantifying epistemic uncertainty in dynamical systems within a data-consistent (DC) framework. The DC framework differs from traditional Bayesian approaches due to the incorporation of the push-forward of an initial density, which performs selective regularization in parameter directions not informed by the data in the re… ▽ More We present a novel method for generating sequential parameter estimates and quantifying epistemic uncertainty in dynamical systems within a data-consistent (DC) framework. The DC framework differs from traditional Bayesian approaches due to the incorporation of the push-forward of an initial density, which performs selective regularization in parameter directions not informed by the data in the resulting updated density. This extends a previous study that included the linear Gaussian theory within the DC framework and introduced the maximal updated density (MUD) estimate as an alternative to both least squares and maximum a posterior (MAP) estimates. In this work, we introduce algorithms for operational settings of MUD estimation in real or near-real time where spatio-temporal datasets arrive in packets to provide updated estimates of parameters and identify potential parameter drift. Computational diagnostics within the DC framework prove critical for evaluating (1) the quality of the DC update and MUD estimate and (2) the detection of parameter value drift. The algorithms are applied to estimate (1) wind drag parameters in a high-fidelity storm surge model, (2) thermal diffusivity field for a heat conductivity problem, and (3) changing infection and incubation rates of an epidemiological model. △ Less

Submitted 14 May, 2024; originally announced May 2024.

Comments: 29 pages, 9 Figures, Code available at https://github.com/UT-CHG/pyDCI

arXiv:2404.11886 [pdf, other]

A Distributions-based Approach for Data-Consistent Inversion

Authors: Kirana Bergstrom, Troy Butler, Tim Wildey

Abstract: We formulate a novel approach to solve a class of stochastic problems, referred to as data-consistent inverse (DCI) problems, which involve the characterization of a probability measure on the parameters of a computational model whose subsequent push-forward matches an observed probability measure on specified quantities of interest (QoI) typically associated with the outputs from the computationa… ▽ More We formulate a novel approach to solve a class of stochastic problems, referred to as data-consistent inverse (DCI) problems, which involve the characterization of a probability measure on the parameters of a computational model whose subsequent push-forward matches an observed probability measure on specified quantities of interest (QoI) typically associated with the outputs from the computational model. Whereas prior DCI solution methodologies focused on either constructing non-parametric estimates of the densities or the probabilities of events associated with the pre-image of the QoI map, we develop and analyze a constrained quadratic optimization approach based on estimating push-forward measures using weighted empirical distribution functions. The method proposed here is more suitable for low-data regimes or high-dimensional problems than the density-based method, as well as for problems where the probability measure does not admit a density. Numerical examples are included to demonstrate the performance of the method and to compare with the density-based approach where applicable. △ Less

Submitted 18 April, 2024; originally announced April 2024.

Comments: 26 pages, 19 figures

MSC Class: 28A50; 65K10; 62G07

arXiv:2404.03538 [pdf]

doi 10.3390/educsci14030219

Quantum Science and Technologies in K-12: Supporting Teachers to Integrate Quantum in STEM Classrooms

Authors: Nancy Holincheck, Jessica L. Rosenberg, Xiaolu Zhang, Tiffany Butler, Michele Colandene, Benjamin W. Dreyfus

Abstract: Quantum science and computing represent a vital intersection between science and technology, gaining increasing importance in modern society. There is a pressing need to incorporate these concepts into the K-12 curriculum, equipping new generations with the tools to navigate and thrive in an evolving technological landscape. This study explores the professional learning of K-12 teachers (n = 49) r… ▽ More Quantum science and computing represent a vital intersection between science and technology, gaining increasing importance in modern society. There is a pressing need to incorporate these concepts into the K-12 curriculum, equipping new generations with the tools to navigate and thrive in an evolving technological landscape. This study explores the professional learning of K-12 teachers (n = 49) related to quantum concepts and pedagogy. We used open-ended surveys, field notes, workshop artifacts, and interviews to examine teachers' perceptions of quantum and how they made connections between quantum and their curriculum. Our data reveal that most teachers were excited and interested in teaching quantum but were aware of potential barriers and concerns that might get in the way of teaching quantum. We found that teachers readily identified connections to math and science in their curriculum, but only a few made connections to computing. Enthusiasm for teaching quantum concepts was found in both elementary and secondary educators, suggesting a widespread recognition of its importance in preparing students for a future where quantum technology is a fundamental aspect of their lives and careers. △ Less

Submitted 4 April, 2024; originally announced April 2024.

Comments: 15 pages

Journal ref: Educ. Sci. 2024, 14, 219

arXiv:2403.03233 [pdf, other]

From Displacements to Distributions: A Machine-Learning Enabled Framework for Quantifying Uncertainties in Parameters of Computational Models

Authors: Taylor Roper, Harri Hakula, Troy Butler

Abstract: This work presents novel extensions for combining two frameworks for quantifying both aleatoric (i.e., irreducible) and epistemic (i.e., reducible) sources of uncertainties in the modeling of engineered systems. The data-consistent (DC) framework poses an inverse problem and solution for quantifying aleatoric uncertainties in terms of pullback and push-forward measures for a given Quantity of Inte… ▽ More This work presents novel extensions for combining two frameworks for quantifying both aleatoric (i.e., irreducible) and epistemic (i.e., reducible) sources of uncertainties in the modeling of engineered systems. The data-consistent (DC) framework poses an inverse problem and solution for quantifying aleatoric uncertainties in terms of pullback and push-forward measures for a given Quantity of Interest (QoI) map. Unfortunately, a pre-specified QoI map is not always available a priori to the collection of data associated with system outputs. The data themselves are often polluted with measurement errors (i.e., epistemic uncertainties), which complicates the process of specifying a useful QoI. The Learning Uncertain Quantities (LUQ) framework defines a formal three-step machine-learning enabled process for transforming noisy datasets into samples of a learned QoI map to enable DC-based inversion. We develop a robust filtering step in LUQ that can learn the most useful quantitative information present in spatio-temporal datasets. The learned QoI map transforms simulated and observed datasets into distributions to perform DC-based inversion. We also develop a DC-based inversion scheme that iterates over time as new spatial datasets are obtained and utilizes quantitative diagnostics to identify both the quality and impact of inversion at each iteration. Reproducing Kernel Hilbert Space theory is leveraged to mathematically analyze the learned QoI map and develop a quantitative sufficiency test for evaluating the filtered data. An illustrative example is utilized throughout while the final two examples involve the manufacturing of shells of revolution to demonstrate various aspects of the presented frameworks. △ Less

Submitted 4 March, 2024; originally announced March 2024.

Comments: 35 pages

MSC Class: 28A50; 60-04; 60-08

arXiv:2402.02236 [pdf]

doi 10.1103/PhysRevPhysEducRes.20.010114

Role of Mentorship, Career Conceptualization, and Leadership in Developing Women's Physics Identity and Belonging

Authors: Jessica L. Rosenberg, Nancy Holincheck, Kathryn Fernández, Benjamin W. Dreyfus, Fardousa Wardere, Stephanie Stehle, Tiffany N. Butler

Abstract: The percentage of women receiving bachelors degrees in physics in the U.S. lags well behind that of men, and women leave the major at higher rates. Achieving equity in physics will mean that women stay in physics at the same rates as men, but this will require changes in the culture and support structures. A strong sense of belonging can lead to higher retention rates so interventions meant to inc… ▽ More The percentage of women receiving bachelors degrees in physics in the U.S. lags well behind that of men, and women leave the major at higher rates. Achieving equity in physics will mean that women stay in physics at the same rates as men, but this will require changes in the culture and support structures. A strong sense of belonging can lead to higher retention rates so interventions meant to increase dimensions of physics identity (interest, recognition, performance, and competence) may increase persistence overall and increase women's retention differentially. We describe our model in which mentorship, an understanding of career options (career conceptualization), and leadership are inputs into the development of these dimensions of physics identity. This paper includes preliminary results from a qualitative study that aims to better understand how career conceptualization, leadership, and mentorship contribute to the development of physics identity and belonging. We report results from a survey of 15 undergraduate physics students which was followed up by interviews with 5 of those students. The students were from a small private liberal arts college in the midwest region of the U.S. and a large public university in the southeast region of the U.S. classified as a Hispanic-serving institution (HSI). With respect to mentorship, we found that it could provide critical support for students' engagement in the physics community. Leadership experiences have not previously been positioned as an important input into identity, yet we found that they helped women in physics feel more confident, contributing to their recognition of themselves as physics people. While the data on how career conceptualization contributed to the building of identity is limited, there are some connections to recognition and competence, and it will be an interesting avenue of future exploration. △ Less

Submitted 3 February, 2024; originally announced February 2024.

Comments: 15 pages, 1 figure, Physical Review Physics Education Research, in press

Journal ref: Phys. Rev. Phys. Educ. Res. 20, 010114 11 March 2024

arXiv:2312.05294 [pdf, other]

doi 10.1002/aenm.202304230

Effects of Grain Boundaries and Surfaces on Electronic and Mechanical Properties of Solid Electrolytes

Authors: Weihang Xie, Zeyu Deng, Zhengyu Liu, Theodosios Famprikis, Keith T. Butler, Pieremanuele Canepa

Abstract: Extended defects, including exposed surfaces and grain boundaries, are critical to the properties of polycrystalline solid electrolytes in all-solid-state batteries (ASSBs). These defects can significantly alter the mechanical and electronic properties of solid electrolytes, with direct manifestations on the performance of ASSBs. Here, by building a library of 590 surfaces and grain boundaries of… ▽ More Extended defects, including exposed surfaces and grain boundaries, are critical to the properties of polycrystalline solid electrolytes in all-solid-state batteries (ASSBs). These defects can significantly alter the mechanical and electronic properties of solid electrolytes, with direct manifestations on the performance of ASSBs. Here, by building a library of 590 surfaces and grain boundaries of 11 relevant solid electrolytes $-$including halides, oxides, and sulfides$-$ their electronic, mechanical, and thermodynamic characteristics are linked to the functional properties of polycrystalline solid electrolytes. It is found that the energy required to mechanically ``separate'' grain boundaries can be significantly lower than in the bulk region of materials, which can trigger preferential cracking of solid electrolyte particles in the grain boundary regions. The brittleness of ceramic solid electrolytes, inferred from the predicted low fracture toughnesses at the grain boundaries, contributes to their cracking under local pressure imparted by Lithium or Sodium penetration in the grain boundaries. Extended defects of solid electrolytes introduce new electronic ``interfacial'' states within bandgaps of solid electrolytes. These interfacial states alter and possibly increase locally the availability of free electrons and holes in solid electrolytes. Factoring effects arising from extended defects appear crucial to explain electrochemical and $-$mechanical observations in ASSBs. △ Less

Submitted 4 January, 2024; v1 submitted 8 December, 2023; originally announced December 2023.

Journal ref: Adv. Energy Mater., 2304230 (2024)

arXiv:2307.13669 [pdf, other]

Order parameters for gauge invariant condensation far from equilibrium

Authors: Jürgen Berges, Kirill Boguslavski, Lillian de Bruin, Tara Butler, Jan M. Pawlowski

Abstract: Nuclear collisions at sufficiently high energies are expected to produce far-from-equilibrium matter with a high density of gluons at early times. We show gauge condensation, which occurs as a consequence of the large density of gluons. To identify this condensation phenomenon, we construct two local gauge-invariant observables that carry the macroscopic zero mode of the gauge condensate. The firs… ▽ More Nuclear collisions at sufficiently high energies are expected to produce far-from-equilibrium matter with a high density of gluons at early times. We show gauge condensation, which occurs as a consequence of the large density of gluons. To identify this condensation phenomenon, we construct two local gauge-invariant observables that carry the macroscopic zero mode of the gauge condensate. The first order parameter for gauge condensation investigated here is the correlator of the spatial Polyakov loop. We also consider, for the first time, the correlator of the gauge invariant scalar field, associated to the exponent of the Polyakov loop. Using real-time lattice simulations of classical-statistical $SU(2)$ gauge theory, we find gauge condensation on a system-size dependent time scale $t_{\text{cond}} \sim L^{1/ζ}$ with a universal scaling exponent $ζ$. Furthermore, we suggest an effective theory formulation describing the dynamics using one of the order parameters identified. The formation of a condensate at early times may have intriguing implications for the early stages in heavy ion collisions. △ Less

Submitted 25 July, 2023; originally announced July 2023.

Comments: 11 pages, 9 figures, 1 table

arXiv:2307.04340 [pdf, other]

Crystal Structure Generation with Autoregressive Large Language Modeling

Authors: Luis M. Antunes, Keith T. Butler, Ricardo Grau-Crespo

Abstract: The generation of plausible crystal structures is often the first step in predicting the structure and properties of a material from its chemical composition. Quickly generating and predicting inorganic crystal structures is important for the discovery of new materials, which can target applications such as energy or electronic devices. However, most current methods for crystal structure predictio… ▽ More The generation of plausible crystal structures is often the first step in predicting the structure and properties of a material from its chemical composition. Quickly generating and predicting inorganic crystal structures is important for the discovery of new materials, which can target applications such as energy or electronic devices. However, most current methods for crystal structure prediction are computationally expensive, slowing the pace of innovation. Seeding structure prediction algorithms with quality generated candidates can overcome a major bottleneck. Here, we introduce CrystaLLM, a methodology for the versatile generation of crystal structures, based on the autoregressive large language modeling (LLM) of the Crystallographic Information File (CIF) format. Trained on millions of CIF files, CrystaLLM focuses on modeling crystal structures through text. CrystaLLM can produce plausible crystal structures for a wide range of inorganic compounds unseen in training, as demonstrated by ab initio simulations. The integration with predictors of formation energy permits the use of a Monte Carlo Tree Search algorithm to improve the generation of meaningful structures. Our approach challenges conventional representations of crystals, and demonstrates the potential of LLMs for learning effective 'world models' of crystal chemistry, which will lead to accelerated discovery and innovation in materials science. △ Less

Submitted 12 February, 2024; v1 submitted 10 July, 2023; originally announced July 2023.

Comments: Added new results and supplementary information

arXiv:2307.00784 [pdf, other]

Element similarity in high-dimensional materials representations

Authors: Anthony Onwuli, Ashish V. Hegde, Kevin Nguyen, Keith T. Butler, Aron Walsh

Abstract: The traditional display of elements in the periodic table is convenient for the study of chemistry and physics. However, the atomic number alone is insufficient for training statistical machine learning models to describe and extract composition-structure-property relationships. Here, we assess the similarity and correlations contained within high-dimensional local and distributed representations… ▽ More The traditional display of elements in the periodic table is convenient for the study of chemistry and physics. However, the atomic number alone is insufficient for training statistical machine learning models to describe and extract composition-structure-property relationships. Here, we assess the similarity and correlations contained within high-dimensional local and distributed representations of the chemical elements, as implemented in an open-source Python package ElementEmbeddings. These include element vectors of up to 200 dimensions derived from known physical properties, crystal structure analysis, natural language processing, and deep learning models. A range of distance measures are compared and a clustering of elements into familiar groups is found using dimensionality reduction techniques. The cosine similarity is used to assess the utility of these metrics for crystal structure prediction, showing that they can outperform the traditional radius ratio rules for the structural classification of AB binary solids. △ Less

Submitted 24 August, 2023; v1 submitted 3 July, 2023; originally announced July 2023.

Comments: 7 pages, 8 figures

arXiv:2306.14401 [pdf, ps, other]

On the distribution of sensitivities of symmetric Boolean functions

Authors: Jon T. Butler, Tsutomu Sasao, Shinobu Nagayama

Abstract: A Boolean function $f({\vec x})$ is sensitive to bit $x_i$ if there is at least one input vector $\vec x$ and one bit $x_i$ in $\vec x$, such that changing $x_i$ changes $f$. A function has sensitivity $s$ if among all input vectors, the largest number of bits to which $f$ is sensitive is $s$. We count the $n$-variable symmetric Boolean functions that have maximum sensitivity. We show that most su… ▽ More A Boolean function $f({\vec x})$ is sensitive to bit $x_i$ if there is at least one input vector $\vec x$ and one bit $x_i$ in $\vec x$, such that changing $x_i$ changes $f$. A function has sensitivity $s$ if among all input vectors, the largest number of bits to which $f$ is sensitive is $s$. We count the $n$-variable symmetric Boolean functions that have maximum sensitivity. We show that most such functions have the largest possible sensitivity, $n$. This suggests sensitivity is limited as a complexity measure for symmetric Boolean functions. △ Less

Submitted 25 June, 2023; originally announced June 2023.

Comments: 5 pages, 0 figures The submitted paper is a journal version of "Enumeration of Symmetric Boolean Functions By Sensitivity" by J. Butler, T. Sasao, and S. Nagayama presented at the Reed-Muller Workshop, Matsue, Japan on May 24, 2023. Paper was presented, but not distributed. Authors retained copyright

arXiv:2301.11419 [pdf, other]

Efficiently predicting high resolution mass spectra with graph neural networks

Authors: Michael Murphy, Stefanie Jegelka, Ernest Fraenkel, Tobias Kind, David Healey, Thomas Butler

Abstract: Identifying a small molecule from its mass spectrum is the primary open problem in computational metabolomics. This is typically cast as information retrieval: an unknown spectrum is matched against spectra predicted computationally from a large database of chemical structures. However, current approaches to spectrum prediction model the output space in ways that force a tradeoff between capturing… ▽ More Identifying a small molecule from its mass spectrum is the primary open problem in computational metabolomics. This is typically cast as information retrieval: an unknown spectrum is matched against spectra predicted computationally from a large database of chemical structures. However, current approaches to spectrum prediction model the output space in ways that force a tradeoff between capturing high resolution mass information and tractable learning. We resolve this tradeoff by casting spectrum prediction as a mapping from an input molecular graph to a probability distribution over molecular formulas. We discover that a large corpus of mass spectra can be closely approximated using a fixed vocabulary constituting only 2% of all observed formulas. This enables efficient spectrum prediction using an architecture similar to graph classification - GrAFF-MS - achieving significantly lower prediction error and orders-of-magnitude faster runtime than state-of-the-art methods. △ Less

Submitted 26 January, 2023; originally announced January 2023.

arXiv:2212.06444 [pdf, other]

Predicting Thermoelectric Transport Properties from Composition with Attention-based Deep Learning

Authors: Luis M. Antunes, Keith T. Butler, Ricardo Grau-Crespo

Abstract: Thermoelectric materials can be used to construct devices which recycle waste heat into electricity. However, the best known thermoelectrics are based on rare, expensive or even toxic elements, which limits their widespread adoption. To enable deployment on global scales, new classes of effective thermoelectrics are thus required. $\textit{Ab initio}$ models of transport properties can help in the… ▽ More Thermoelectric materials can be used to construct devices which recycle waste heat into electricity. However, the best known thermoelectrics are based on rare, expensive or even toxic elements, which limits their widespread adoption. To enable deployment on global scales, new classes of effective thermoelectrics are thus required. $\textit{Ab initio}$ models of transport properties can help in the design of new thermoelectrics, but they are still too computationally expensive to be solely relied upon for high-throughput screening in the vast chemical space of all possible candidates. Here, we use models constructed with modern machine learning techniques to scan very large areas of inorganic materials space for novel thermoelectrics, using composition as an input. We employ an attention-based deep learning model, trained on data derived from $\textit{ab initio}$ calculations, to predict a material's Seebeck coefficient, electrical conductivity, and power factor over a range of temperatures and $\textit{n}$- or $\textit{p}$-type doping levels, with surprisingly good performance given the simplicity of the input, and with significantly lower computational cost. The results of applying the model to a space of known and hypothetical binary and ternary selenides reveal several materials that may represent promising thermoelectrics. Our study establishes a protocol for composition-based prediction of thermoelectric behaviour that can be easily enhanced as more accurate theoretical or experimental databases become available. △ Less

Submitted 13 December, 2022; originally announced December 2022.

arXiv:2212.04587 [pdf, other]

doi 10.1016/j.cma.2023.115906

Parameter Estimation with Maximal Updated Densities

Authors: Michael Pilosov, Carlos del-Castillo-Negrete, Tian Yu Yen, Troy Butler, Clint Dawson

Abstract: A recently developed measure-theoretic framework solves a stochastic inverse problem (SIP) for models where uncertainties in model output data are predominantly due to aleatoric (i.e., irreducible) uncertainties in model inputs (i.e., parameters). The subsequent inferential target is a distribution on parameters. Another type of inverse problem is to quantify uncertainties in estimates of "true" p… ▽ More A recently developed measure-theoretic framework solves a stochastic inverse problem (SIP) for models where uncertainties in model output data are predominantly due to aleatoric (i.e., irreducible) uncertainties in model inputs (i.e., parameters). The subsequent inferential target is a distribution on parameters. Another type of inverse problem is to quantify uncertainties in estimates of "true" parameter values under the assumption that such uncertainties should be reduced as more data are incorporated into the problem, i.e., the uncertainty is considered epistemic. A major contribution of this work is the formulation and solution of such a parameter identification problem (PIP) within the measure-theoretic framework developed for the SIP. The approach is novel in that it utilizes a solution to a stochastic forward problem (SFP) to update an initial density only in the parameter directions informed by the model output data. In other words, this method performs "selective regularization" only in the parameter directions not informed by data. The solution is defined by a maximal updated density (MUD) point where the updated density defines the measure-theoretic solution to the PIP. Another significant contribution of this work is the full theory of existence and uniqueness of MUD points for linear maps with Gaussian distributions. Data-constructed Quantity of Interest (QoI) maps are also presented and analyzed for solving the PIP within this measure-theoretic framework as a means of reducing uncertainties in the MUD estimate. We conclude with a demonstration of the general applicability of the method on two problems involving either spatial or temporal data for estimating uncertain model parameters. △ Less

Submitted 19 January, 2023; v1 submitted 8 December, 2022; originally announced December 2022.

Comments: Code: github.com/mathematicalmichael/mud.git

arXiv:2209.02488 [pdf]

Operation of the H- Linac at FNAL

Authors: K. Seiya, T. Butler, D. Jones, V. Kapin, K. Hartman, S. Moua, J. -F. Ostiguy, R. Ridgway, R. Sharankova, B. Stanzil, C. Y. Tan, J. Walters, M. Wesley, M. Mwaniki

Abstract: The Fermi National Accelerator Laboratory (FNAL) Linac has been in operation for 52 years. In approximately four years, it will be replaced by a new 800 MeV superconducting machine, the PIP-II SRF Linac. In the current configuration, the Linac delivers H- ions at 400 MeV and injects protons by charge exchange into the Booster synchrotron. Despite its age, the Linac is the most stable accelerator i… ▽ More The Fermi National Accelerator Laboratory (FNAL) Linac has been in operation for 52 years. In approximately four years, it will be replaced by a new 800 MeV superconducting machine, the PIP-II SRF Linac. In the current configuration, the Linac delivers H- ions at 400 MeV and injects protons by charge exchange into the Booster synchrotron. Despite its age, the Linac is the most stable accelerator in the FNAL complex, reliably sending 22 mA in daily operations. We will discuss the status of the operation, beam studies, and plans. △ Less

Submitted 6 September, 2022; originally announced September 2022.

Report number: FERMILAB-CONF-22-634-AD

arXiv:2207.13389 [pdf, other]

Versatile Domain Mapping Of Scanning Electron Nanobeam Diffraction Datasets Utilising Variational AutoEncoders and Decoder-Assisted Latent-Space Clustering

Authors: Andy Bridger, William I. F. David, Thomas J. Wood, Mohsen Danaie, Keith T. Butler

Abstract: Advancements in fast electron detectors have enabled the statistically significant sampling of crystal structures on the nanometre scale by means of Scanning Electron Nanobeam Diffraction (SEND). Characterisation of structural similarity across this length scale is key to bridging the gap between local atomic structure (using atomic resolution techniques such as High Resolution Scanning Transmissi… ▽ More Advancements in fast electron detectors have enabled the statistically significant sampling of crystal structures on the nanometre scale by means of Scanning Electron Nanobeam Diffraction (SEND). Characterisation of structural similarity across this length scale is key to bridging the gap between local atomic structure (using atomic resolution techniques such as High Resolution Scanning Transmission Electron Microscopy (HR-STEM)) and the macro-scale (using bulk techniques such as powder X-ray and neutron diffraction). The use of SEND technique allows for structural investigation of a broad range of samples, due to the techniques ability to operate with low electron dosage and its tolerance for sample thickness, relative to HR-STEM. This, coupled with the capacity for data collection over a wide areas and the automation of this collection, allows for statistically representative sampling of the microstructure. Also due to these factors, SEND generates large datasets and as a result automated/ semi-automated data processing workflows are required to aid in maximal extraction of useful information. As such, this paper outlines a versatile, data-driven approach for producing domain maps, as well as a statistical approach for assessing their applicability. The production of such domain maps for a dataset can help highlight nuance in the microstructure, as well as improve the manageability of that dataset for further investigation. The workflow outlined utilises a Variational AutoEncoder to identify and learn the sources of variance in the diffraction signal and this, in combination with clustering techniques, is used to produce domain maps for a set of varied example cases. This approach: is agnostic to domain crystallinity; requires no prior knowledge of crystal structure; and does not require the, potentially prohibitive, simulation of a library of appropriate diffraction patterns. △ Less

Submitted 27 July, 2022; originally announced July 2022.

arXiv:2207.02980 [pdf, other]

Multi-scale Sinusoidal Embeddings Enable Learning on High Resolution Mass Spectrometry Data

Authors: Gennady Voronov, Rose Lightheart, Joe Davison, Christoph A. Krettler, David Healey, Thomas Butler

Abstract: Small molecules in biological samples are studied to provide information about disease states, environmental toxins, natural product drug discovery, and many other applications. The primary window into the composition of small molecule mixtures is tandem mass spectrometry (MS2), which produces data that are of high sensitivity and part per million resolution. We adopt multi-scale sinusoidal embedd… ▽ More Small molecules in biological samples are studied to provide information about disease states, environmental toxins, natural product drug discovery, and many other applications. The primary window into the composition of small molecule mixtures is tandem mass spectrometry (MS2), which produces data that are of high sensitivity and part per million resolution. We adopt multi-scale sinusoidal embeddings of the mass data in MS2 designed to meet the challenge of learning from the full resolution of MS2 data. Using these embeddings, we provide a new state of the art model for spectral library search, the standard task for initial evaluation of MS2 data. We also introduce a new task, chemical property prediction from MS2 data, that has natural applications in high-throughput MS2 experiments and show that an average $R^2$ of 80\% for novel compounds can be achieved across 10 chemical properties prioritized by medicinal chemists. We use dimensionality reduction techniques and experiments with different floating point resolutions to show the essential role multi-scale sinusoidal embeddings play in learning from MS2 data. △ Less

Submitted 5 May, 2023; v1 submitted 6 July, 2022; originally announced July 2022.

arXiv:2205.10084 [pdf]

Spinel nitride solid solutions: charting properties in the configurational space with explainable machine learning

Authors: Pablo Sánchez-Palencia, Said Hamad, Pablo Palacios, Ricardo Grau-Crespo, Keith T. Butler

Abstract: Ab initio prediction of the variation of properties in the configurational space of solid solutions is computationally very demanding. We present an approach to accelerate these predictions via a combination of density functional theory and machine learning, using the cubic spinel nitride GeSn$_2$N$_4$ as a case study, exploring how formation energy and electronic bandgap are affected by configura… ▽ More Ab initio prediction of the variation of properties in the configurational space of solid solutions is computationally very demanding. We present an approach to accelerate these predictions via a combination of density functional theory and machine learning, using the cubic spinel nitride GeSn$_2$N$_4$ as a case study, exploring how formation energy and electronic bandgap are affected by configurational variations. Furthermore, we demonstrate the utility of applying explainable machine learning to understand the crystal chemistry origins of the trends that we observe. Different configuration descriptors (Coulomb matrix eigenspectrum, many-body tensor representation, and cluster correlation function vectors) are combined with different models (linear regression, gradient-boosted decision tree, and multi-layer perceptron) to extrapolate the calculation of ab initio properties from a small set of configurations to the full space with thousands of configurations. We discuss the performance of different descriptors and models. SHAP (SHapley Additive exPlanations) analysis of the machine learning models highlights how values of formation energy are dominated by variations in local crystal structure (single polyhedral environments), while values of electronic bandgap are dominated by variations in more extended structural motifs. Finally, we demonstrate the usefulness of this approach by constructing structure-property maps, identifying important configurations of GeSn$_2$N$_4$ with extremal properties, as well as by calculating accurate equilibrium properties using configurational averaging. △ Less

Submitted 20 May, 2022; originally announced May 2022.

arXiv:2204.05711 [pdf, other]

doi 10.3847/1538-3881/ac6589

HD 83443c: A highly eccentric giant planet on a 22-year orbit

Authors: Adriana Errico, Robert A. Wittenmyer, Jonathan Horner, Zhexing Li, Gregory Mirek Brandt, Stephen R. Kane, Tara Fetherolf, Timothy R. Holt, Brad Carter, Jake T. Clark. Robert . P. Butler, Chris G. Tinney, Sarah Ballard, Brendan P. Bowler, John Kielkopf, Huigen Liu, Peter P. Plavchan, Avi Shporer, Hui Zhang, Duncan J. Wright, Brett C. Addison, Matthew W. Mengel, Jack Okumura

Abstract: We report the discovery of a highly eccentric long-period Jovian planet orbiting the hot-Jupiter host HD\,83443. By combining radial velocity data from four instruments (AAT/UCLES, Keck/HIRES, HARPS, Minerva-Australis) spanning more than two decades, we find evidence for a planet with m~sin~$i=1.35^{+0.07}_{-0.06}$\,\mj, moving on an orbit with $a=8.0\pm$0.8\,au and eccentricity $e=0.76\pm$0.05. W… ▽ More We report the discovery of a highly eccentric long-period Jovian planet orbiting the hot-Jupiter host HD\,83443. By combining radial velocity data from four instruments (AAT/UCLES, Keck/HIRES, HARPS, Minerva-Australis) spanning more than two decades, we find evidence for a planet with m~sin~$i=1.35^{+0.07}_{-0.06}$\,\mj, moving on an orbit with $a=8.0\pm$0.8\,au and eccentricity $e=0.76\pm$0.05. We combine our radial velocity analysis with \textit{Gaia} eDR3 /\textit{Hipparcos} proper motion anomalies and derive a dynamical mass of $1.5^{+0.5}_{-0.2} M_{\rm Jup}$. We perform a detailed dynamical simulation that reveals locations of stability within the system that may harbor additional planets, including stable regions within the habitable zone of the host star. HD\,83443 is a rare example of a system hosting a hot Jupiter and an exterior planetary companion. The high eccentricity of HD\,83443c suggests that a scattering event may have sent the hot Jupiter to its close orbit while leaving the outer planet on a wide and eccentric path. △ Less

Submitted 12 April, 2022; originally announced April 2022.

arXiv:2202.02318 [pdf]

doi 10.1007/978-3-030-69288-9

Self-Organization In Stellar Evolution: Size-Complexity Rule

Authors: Travis Herman Butler, Georgi Yordanov Georgiev

Abstract: Complexity Theory is highly interdisciplinary, therefore any regularities must hold on all levels of organization, independent on the nature of the system. An open question in science is how complex systems self-organize to produce emergent structures and properties, a branch of non-equilibrium thermodynamics. It has long been known that there is a quantity-quality transition in natural systems. T… ▽ More Complexity Theory is highly interdisciplinary, therefore any regularities must hold on all levels of organization, independent on the nature of the system. An open question in science is how complex systems self-organize to produce emergent structures and properties, a branch of non-equilibrium thermodynamics. It has long been known that there is a quantity-quality transition in natural systems. This is to say that the properties of a system depend on its size. More recently, this has been termed the size-complexity rule, which means that to increase their size, systems must increase their complexity, and that to increase their complexity they must grow in size. This rule goes under different names in different disciplines and systems of different nature, such as the area-speciation rule, economies of scale, scaling relations (allometric) in biology and for cities, and many others. We apply the size-complexity rule to stars to compare them with other complex systems in order to find universal patterns of self-organization independent of the substrate. Here, as a measure of complexity of a star, we are using the degree of grouping of nucleons into atoms, which reduces nucleon entropy, increases the variety of elements, and changes the structure of the star. As seen in our previous work, complexity, using action efficiency, is in power law proportionality of all other characteristics of a complex system, including its size. Here we find that, as for the other systems studied, the complexity of stars is in a power law proportionality with their size - the bigger a system is, the higher its level of complexity is - despite differing explosion energies and initial metallicities from simulations and data, which confirms the size-complexity rule and our model. △ Less

Submitted 4 February, 2022; originally announced February 2022.

Comments: 20 pages, 3 figures, 11 tables, 9 equations, Conference on Complex Systems CCS2017, Cancun, Mexico, Sattelite meeting "Efficiency in complex systems"

Journal ref: Springer Proceedings in Complexity. 2022

arXiv:2201.11161 [pdf]

Co-substituted BiFeO3: electronic, ferroelectric, and thermodynamic properties from first principles

Authors: Shivani Grover, Keith T. Butler, Umesh V Waghmare, Ricardo Grau-Crespo

Abstract: Bismuth ferrite, BiFeO3, is a multiferroic solid that is attracting increasing attention as a potential photocatalytic material, because the ferroelectric polarisation enhances the separation of photogenerated carriers. With the motivation of finding routes to engineer the band gap and the band alignment, while conserving or enhancing the ferroelectric properties, we have investigated the thermody… ▽ More Bismuth ferrite, BiFeO3, is a multiferroic solid that is attracting increasing attention as a potential photocatalytic material, because the ferroelectric polarisation enhances the separation of photogenerated carriers. With the motivation of finding routes to engineer the band gap and the band alignment, while conserving or enhancing the ferroelectric properties, we have investigated the thermodynamic, electronic and ferroelectric properties of BiCoxFe1 xO3 solid solutions, with 0 < x < 0.13, using density functional theory. We show that the band gap can be reduced from 2.9 eV to 2.1 eV by cobalt substitution, while simultaneously increasing the spontaneous polarisation, which is associated with a notably larger Born effective charge of Co compared to Fe cations. We discuss the interaction between Co impurities, which is strongly attractive and would drive the aggregation of Co, as evidenced by Monte Carlo simulations. Phase separation into a Co-rich phase is therefore predicted to be thermodynamically preferred, and the homogeneous solid solution can only exist in metastable form, protected by slow cation diffusion kinetics. Finally, we discuss the band alignment of pure and Co-substituted BiFeO3 with relevant redox potentials, in the context of its applicability in photocatalysis. △ Less

Submitted 4 August, 2022; v1 submitted 26 January, 2022; originally announced January 2022.

Comments: Biblography expanded; typos corrected; improved discussion of photocatalytic applications

arXiv:2112.09795 [pdf]

doi 10.1039/D1TA10860C

Mixed-anion mixed-cation perovskite (FAPbI$_3$)$_{0.875}$(MAPbBr$_3$)$_{0.125}$: an ab-initio molecular dynamics study

Authors: Eduardo Menéndez-Proupin, Shivani Grover, Ana L. Montero-Alejo, Scott D. Midgley, Keith T. Butler, Ricardo Grau-Crespo

Abstract: Mixed-anion mixed-cation perovskites with (FAPbI$_3$)$_{1-x}$(MAPbBr$_3$)$_x$ composition have allowed record efficiencies in photovoltaic solar cells, but their atomic-scale behaviour is not well understood yet, in part because their theoretical modelling requires consideration of complex and interrelated dynamic and disordering effects. We present here an ab initio molecular dynamics investigati… ▽ More Mixed-anion mixed-cation perovskites with (FAPbI$_3$)$_{1-x}$(MAPbBr$_3$)$_x$ composition have allowed record efficiencies in photovoltaic solar cells, but their atomic-scale behaviour is not well understood yet, in part because their theoretical modelling requires consideration of complex and interrelated dynamic and disordering effects. We present here an ab initio molecular dynamics investigation of the structural, thermodynamic, and electronic properties of the (FAPbI$_3$)$_{0.875}$(MAPbBr$_3$)$_{0.125}$ perovskite. A special quasi-random structure is proposed to mimic the disorder of both the molecular cations and the halide anions, in a stoichiometry that is close to that of one of today's most efficient perovskite solar cells. We show that the rotation of the organic cations is more strongly hindered in the mixed structure in comparison with the pure compounds. Our analysis suggests that this mixed perovskite is thermodynamically stable against phase separation despite the endothermic mixing enthalpy, due to the large configurational entropy. The electronic properties are investigated by hybrid density functional calculations including spin-orbit coupling in carefully selected representative configurations extracted from the molecular dynamics. Our model, that is validated here against experimental information, provides a more sophisticated understanding of the interplay between dynamic and disordering effects in this important family of photovoltaic materials. △ Less

Submitted 17 December, 2021; originally announced December 2021.

Comments: 10 pages, 7 figures

Journal ref: Journal of Materials Chemistry A, 2022

arXiv:2108.12865 [pdf]

doi 10.1039/D1CP05623A

Ultralow Work Function of the Electride Sr$_3$CrN$_3$

Authors: Cuicui Wang, Miaoting Xu, Keith T. Butler, Lee A. Burton

Abstract: Electrides have valence electrons that occupy free space in the crystal structure, making them easier to extract. This feature can be used in catalysis for important reactions that usually requires a high-temperature and high-pressure environments, such as ammonia synthesis. In this paper, we use density functional theory to investigate the behaviour of interstitial electrons of the 1-dimensional… ▽ More Electrides have valence electrons that occupy free space in the crystal structure, making them easier to extract. This feature can be used in catalysis for important reactions that usually requires a high-temperature and high-pressure environments, such as ammonia synthesis. In this paper, we use density functional theory to investigate the behaviour of interstitial electrons of the 1-dimensional electride Sr$_3$CrN$_3$. We find that the bulk excess electron density persists on introduction of surface terminations, that the crystal termination perpendicular to the 1D free-electron channel is highly stable and we confirm an extremely low work function with hybrid functional methods. Our results indicate that Sr$_3$CrN$_3$ is a potentially important novel catalyst, with accessible, directional and extractable free electron density. △ Less

Submitted 29 August, 2021; originally announced August 2021.

Comments: 4 pages, 4 figures

arXiv:2108.02077 [pdf, other]

doi 10.1063/5.0065694

Entropy-based Active Learning of Graph Neural Network Surrogate Models for Materials Properties

Authors: Johannes Allotey, Keith T. Butler, Jeyan Thiyagalingam

Abstract: Graph neural networks, trained on experimental or calculated data are becoming an increasingly important tool in computational materials science. Networks, once trained, are able to make highly accurate predictions at a fraction of the cost of experiments or first-principles calculations of comparable accuracy. However these networks typically rely on large databases of labelled experiments to tra… ▽ More Graph neural networks, trained on experimental or calculated data are becoming an increasingly important tool in computational materials science. Networks, once trained, are able to make highly accurate predictions at a fraction of the cost of experiments or first-principles calculations of comparable accuracy. However these networks typically rely on large databases of labelled experiments to train the model. In scenarios where data is scarce or expensive to obtain this can be prohibitive. By building a neural network that provides a confidence on the predicted properties, we are able to develop an active learning scheme that can reduce the amount of labelled data required, by identifying the areas of chemical space where the model is most uncertain. We present a scheme for coupling a graph neural network with a Gaussian process to featurise solid-state materials and predict properties \textit{including} a measure of confidence in the prediction. We then demonstrate that this scheme can be used in an active learning context to speed up the training of the model, by selecting the optimal next experiment for obtaining a data label. Our active learning scheme can double the rate at which the performance of the model on a test data set improves with additional data compared to choosing the next sample at random. This type of uncertainty quantification and active learning has the potential to open up new areas of materials science, where data are scarce and expensive to obtain, to the transformative power of graph neural networks. △ Less

Submitted 13 August, 2021; v1 submitted 4 August, 2021; originally announced August 2021.

arXiv:2107.14664 [pdf, other]

Distributed Representations of Atoms and Materials for Machine Learning

Authors: Luis M. Antunes, Ricardo Grau-Crespo, Keith T. Butler

Abstract: The use of machine learning is becoming increasingly common in computational materials science. To build effective models of the chemistry of materials, useful machine-based representations of atoms and their compounds are required. We derive distributed representations of compounds from their chemical formulas only, via pooling operations of distributed representations of atoms. These compound re… ▽ More The use of machine learning is becoming increasingly common in computational materials science. To build effective models of the chemistry of materials, useful machine-based representations of atoms and their compounds are required. We derive distributed representations of compounds from their chemical formulas only, via pooling operations of distributed representations of atoms. These compound representations are evaluated on ten different tasks, such as the prediction of formation energy and band gap, and are found to be competitive with existing benchmarks that make use of structure, and even superior in cases where only composition is available. Finally, we introduce a new approach for learning distributed representations of atoms, named SkipAtom, which makes use of the growing information in materials structure databases. △ Less

Submitted 30 July, 2021; originally announced July 2021.

arXiv:2107.06830 [pdf, other]

doi 10.1088/1361-6544/ac7d8b

Invariant tori for multi-dimensional integrable hamiltonians coupled to a single thermostat

Authors: Leo T. Butler

Abstract: This paper demonstrates sufficient conditions for the existence of KAM tori in a singly thermostated, integrable hamiltonian system with $n$ degrees of freedom with a focus on the generalized, variable-mass thermostats of order 2--which include the Nosé thermostat, the logistic thermostat of Tapias, Bravetti and Sanders, and the Winkler thermostat. It extends Theorem 3.2 of Legoll, Luskin & Moecke… ▽ More This paper demonstrates sufficient conditions for the existence of KAM tori in a singly thermostated, integrable hamiltonian system with $n$ degrees of freedom with a focus on the generalized, variable-mass thermostats of order 2--which include the Nosé thermostat, the logistic thermostat of Tapias, Bravetti and Sanders, and the Winkler thermostat. It extends Theorem 3.2 of Legoll, Luskin & Moeckel, (Non-ergodicity of Nosé-Hoover dynamics, Nonlinearity, 22 (2009), pp. 1673--1694) to prove that a "typical" singly thermostated, integrable, real-analytic hamiltonian possesses a positive-measure set of invariant tori when the thermostat is weakly coupled. It also demonstrates a class of integrable hamiltonians, which, for a full-measure set of couplings, satisfies the same conclusion. △ Less

Submitted 14 July, 2021; originally announced July 2021.

Comments: 32 pages; 7 figures

MSC Class: 70H08; 37J40; 82B05; 70F40

arXiv:2104.10130 [pdf, other]

Hidden Biases in Unreliable News Detection Datasets

Authors: Xiang Zhou, Heba Elfardy, Christos Christodoulopoulos, Thomas Butler, Mohit Bansal

Abstract: Automatic unreliable news detection is a research problem with great potential impact. Recently, several papers have shown promising results on large-scale news datasets with models that only use the article itself without resorting to any fact-checking mechanism or retrieving any supporting evidence. In this work, we take a closer look at these datasets. While they all provide valuable resources… ▽ More Automatic unreliable news detection is a research problem with great potential impact. Recently, several papers have shown promising results on large-scale news datasets with models that only use the article itself without resorting to any fact-checking mechanism or retrieving any supporting evidence. In this work, we take a closer look at these datasets. While they all provide valuable resources for future research, we observe a number of problems that may lead to results that do not generalize in more realistic settings. Specifically, we show that selection bias during data collection leads to undesired artifacts in the datasets. In addition, while most systems train and predict at the level of individual articles, overlapping article sources in the training and evaluation data can provide a strong confounding factor that models can exploit. In the presence of this confounding factor, the models can achieve good performance by directly memorizing the site-label mapping instead of modeling the real task of unreliable news detection. We observed a significant drop (>10%) in accuracy for all models tested in a clean split with no train/test source overlap. Using the observations and experimental results, we provide practical suggestions on how to create more reliable datasets for the unreliable news detection task. We suggest future dataset creation include a simple model as a difficulty/bias probe and future model development use a clean non-overlapping site and date split. △ Less

Submitted 20 April, 2021; originally announced April 2021.

Comments: EACL 2021 (11 pages, 3 figures, 8 tables)

arXiv:2101.05355 [pdf]

doi 10.1038/s41535-021-00315-8

Thickness dependent tuning of the Berry curvature in a ferromagnetic Weyl semimetal

Authors: Yao Zhang, Yuefeng Yin, Guy Dubuis, Tane Butler, Nikhil V. Medhekar, Simon Granville

Abstract: Magnetic Weyl semimetals with spontaneously broken time-reversal symmetry exhibit a large intrinsic anomalous Hall effect originating from the Berry curvature. To employ this large Hall current for room temperature topo-spintronics applications, it is necessary to fabricate these materials as thin or ultrathin films. Here, we experimentally demonstrate that Weyl semimetal Co2MnGa thin films (20-50… ▽ More Magnetic Weyl semimetals with spontaneously broken time-reversal symmetry exhibit a large intrinsic anomalous Hall effect originating from the Berry curvature. To employ this large Hall current for room temperature topo-spintronics applications, it is necessary to fabricate these materials as thin or ultrathin films. Here, we experimentally demonstrate that Weyl semimetal Co2MnGa thin films (20-50 nm) show a very large anomalous Hall angle ~11.4% at low temperature and ~9.7% at room temperature, which can be ascribed to the nontrivial topology of the band structure with large intrinsic Berry curvature. However, the anomalous Hall angle decreases significantly with thicknesses below 20 nm, which band structure calculations confirm is due to the reduction of the majority spin contribution to the Berry curvature. Our results suggest that Co2MnGa is an excellent material to realize room temperature topo-spintronics applications, however the significant thickness dependence of the Berry curvature has important implications for thin film device design △ Less

Submitted 13 January, 2021; originally announced January 2021.

Journal ref: npj Quantum Mater. 6, 17 (2021)

arXiv:2011.04584 [pdf, other]

doi 10.1088/1361-648X/abea1c

Interpretable, calibrated neural networks for analysis and understanding of inelastic neutron scattering data

Authors: Keith T. Butler, Manh Duc Le, Jeyarajan Thiyagalingam, Toby G. Perring

Abstract: Deep neural networks provide flexible frameworks for learning data representations and functions relating data to other properties and are often claimed to achieve 'super-human' performance in inferring relationships between input data and desired property. In the context of inelastic neutron scattering experiments, however, as in many other scientific scenarios, a number of issues arise: (i) scar… ▽ More Deep neural networks provide flexible frameworks for learning data representations and functions relating data to other properties and are often claimed to achieve 'super-human' performance in inferring relationships between input data and desired property. In the context of inelastic neutron scattering experiments, however, as in many other scientific scenarios, a number of issues arise: (i) scarcity of labelled experimental data, (ii) lack of uncertainty quantification on results, and (iii) lack of interpretability of the deep neural networks. In this work we examine approaches to all three issues. We use simulated data to train a deep neural network to distinguish between two possible magnetic exchange models of a half-doped manganite. We apply the recently developed deterministic uncertainty quantification method to provide error estimates for the classification, demonstrating in the process how important realistic representations of instrument resolution in the training data are for reliable estimates on experimental data. Finally we use class activation maps to determine which regions of the spectra are most important for the final classification result reached by the network. △ Less

Submitted 20 November, 2020; v1 submitted 9 November, 2020; originally announced November 2020.

arXiv:2010.06423 [pdf]

A comprehensive protocol for manual segmentation of the human claustrum and its sub-regions using high-resolution MRI

Authors: Seung Suk Kang, Joseph Bodenheimer, Tracey Butler

Abstract: The claustrum (Cl) is a thin grey matter structure located in the center of each brain hemisphere. Cl has been hypothesized as a central hub of the brain for multisensory/sensorimotor integration, consciousness, and attention. Accumulating evidence has suggested that Cl might be important in the development of severe neurological and psychiatric symptoms including epileptic seizures and psychosis.… ▽ More The claustrum (Cl) is a thin grey matter structure located in the center of each brain hemisphere. Cl has been hypothesized as a central hub of the brain for multisensory/sensorimotor integration, consciousness, and attention. Accumulating evidence has suggested that Cl might be important in the development of severe neurological and psychiatric symptoms including epileptic seizures and psychosis. However, the specifics of the roles of Cl in human epilepsy and psychosis are largely unknown, primarily due to methodological limitations related to the thin morphology of Cl that is challenging to delineate accurately using conventional methods. The goal of this work is to develop noninvasive multimodal neuroimaging methods to delineate Cl anatomy by utilizing a large healthy adult high resolution (0.7mm3) T1-weighted MRI collected as part of the Washington University-Minnesota Consortium Human Connectome Project (WU-Minn HCP). We developed a comprehensive manual segmentation protocol to delineate Cl based on a cellular level brain atlas. The protocol involves detailed guidelines to delineate the three subregions of Cl, including the dorsal, ventral, and temporal Cl that can be parcellated based on a geometric method. As demonstrated in a representative result, Cl is large in its anterior-posterior, and the dorsal-ventral extent. Also, the volume is comparable to that of the amygdala. It is required to assess the reliability of the protocol so that it can be used for future anatomical studies of neuropsychiatric disorders, including epilepsy and schizophrenia. △ Less

Submitted 13 October, 2020; originally announced October 2020.

Comments: 15 pages, 6 figures

arXiv:2009.06918 [pdf, other]

Learning Quantities of Interest from Dynamical Systems for Observation-Consistent Inversion

Authors: Steven Mattis, Kyle Robert Steffen, Troy Butler, Clint N. Dawson, Donald Estep

Abstract: Dynamical systems arise in a wide variety of mathematical models from science and engineering. A common challenge is to quantify uncertainties on model inputs (parameters) that correspond to a quantitative characterization of uncertainties on observable Quantities of Interest (QoI). To this end, we consider a stochastic inverse problem (SIP) with a solution described by a pullback probability meas… ▽ More Dynamical systems arise in a wide variety of mathematical models from science and engineering. A common challenge is to quantify uncertainties on model inputs (parameters) that correspond to a quantitative characterization of uncertainties on observable Quantities of Interest (QoI). To this end, we consider a stochastic inverse problem (SIP) with a solution described by a pullback probability measure. We call this an observation-consistent solution, as its subsequent push-forward through the QoI map matches the observed probability distribution on model outputs. A distinction is made between QoI useful for solving the SIP and arbitrary model output data. In dynamical systems, model output data are often given as a series of state variable responses recorded over a particular time window. Consequently, the dimension of output data can easily exceed $\mathcal{O}(1E4)$ or more due to the frequency of observations, and the correct choice or construction of a QoI from this data is not self-evident. We present a new framework, Learning Uncertain Quantities (LUQ), that facilitates the tractable solution of SIPs for dynamical systems. Given ensembles of predicted (simulated) time series and (noisy) observed data, LUQ provides routines for filtering data, unsupervised learning of the underlying dynamics, classifying observations, and feature extraction to learn the QoI map. Subsequently, time series data are transformed into samples of the underlying predicted and observed distributions associated with the QoI so that solutions to the SIP are computable. Following the introduction and demonstration of LUQ, numerical results from several SIPs are presented for a variety of dynamical systems arising in the life and physical sciences. For scientific reproducibility, we provide links to our Python implementation of LUQ and to all data and scripts required to reproduce the results in this manuscript. △ Less

Submitted 16 July, 2021; v1 submitted 15 September, 2020; originally announced September 2020.

Comments: 38 pages, 14 figures. Submitted to Computer Methods in Applied Mechanics and Engineering

arXiv:2005.05831 [pdf, other]

doi 10.1063/5.0013136

Modelling the dielectric constants of crystals using machine learning

Authors: Kazuki Morita, Daniel W. Davies, Keith T. Butler, Aron Walsh

Abstract: The relative permittivity of a crystal is a fundamental property that links microscopic chemical bonding to macroscopic electromagnetic response. Multiple models, including analytical, numerical and statistical descriptions, have been made to understand and predict dielectric behaviour. Analytical models are often limited to a particular type of compounds, whereas machine learning (ML) models ofte… ▽ More The relative permittivity of a crystal is a fundamental property that links microscopic chemical bonding to macroscopic electromagnetic response. Multiple models, including analytical, numerical and statistical descriptions, have been made to understand and predict dielectric behaviour. Analytical models are often limited to a particular type of compounds, whereas machine learning (ML) models often lack interpretability. Here, we combine supervised ML, density functional perturbation theory, and analysis based on game theory to predict and explain the physical trends in optical dielectric constants of crystals. Two ML models, support vector regression and deep neural networks, were trained on a dataset of 1,364 dielectric constants. Shapley additive explanations (SHAP) analysis of the ML models reveals that they recover correlations described by textbook Clausius-Mossotti and Penn models, which gives confidence in their ability to describe physical behavior, while providing superior predictive power. △ Less

Submitted 12 May, 2020; originally announced May 2020.

Comments: 14 pages, 4 figures, 4 tables

arXiv:2005.03740 [pdf, other]

doi 10.1088/1361-6382/abac46

Horseshoes and invariant tori in cosmological models with a coupled field and non-zero curvature

Authors: Leo T. Butler

Abstract: This paper studies the dynamics of a family of hamiltonian systems that originate from Friedman-Lemaître-Robertson-Walker space-times with a coupled field and non-zero curvature. In four distinct cases, previously considered by Maciejewski, Przybylska, Stachowiak & Szydowski, it is shown that there are homoclinic connections to invariant submanifolds and the connections split. These results imply… ▽ More This paper studies the dynamics of a family of hamiltonian systems that originate from Friedman-Lemaître-Robertson-Walker space-times with a coupled field and non-zero curvature. In four distinct cases, previously considered by Maciejewski, Przybylska, Stachowiak & Szydowski, it is shown that there are homoclinic connections to invariant submanifolds and the connections split. These results imply the non-existence of a real-analytic integral independent of the hamiltonian. △ Less

Submitted 7 May, 2020; originally announced May 2020.

Comments: 21 pages, 4 figures

MSC Class: 37J30; 37J35; 70F07; 70F08

arXiv:2001.04369 [pdf, other]

Convergence of Probability Densities using Approximate Models for Forward and Inverse Problems in Uncertainty Quantification: Extensions to $L^p$

Authors: Troy Butler, Tim Wildey, Wenjuan Zhang

Abstract: A previous study analyzed the convergence of probability densities for forward and inverse problems when a sequence of approximate maps between model inputs and outputs converges in $L^\infty$. This work generalizes the analysis to cases where the approximate maps converge in $L^p$ for any $1\leq p < \infty$. Specifically, under the assumption that the approximate maps converge in $L^p$, the conve… ▽ More A previous study analyzed the convergence of probability densities for forward and inverse problems when a sequence of approximate maps between model inputs and outputs converges in $L^\infty$. This work generalizes the analysis to cases where the approximate maps converge in $L^p$ for any $1\leq p < \infty$. Specifically, under the assumption that the approximate maps converge in $L^p$, the convergence of probability density functions solving either forward or inverse problems is proven in $L^q$ where the value of $1\leq q<\infty$ may even be greater than $p$ in certain cases. This greatly expands the applicability of the previous results to commonly used methods for approximating models (such as polynomial chaos expansions) that only guarantee $L^p$ convergence for some $1\leq p<\infty$. Several numerical examples are also included along with numerical diagnostics of solutions and verification of assumptions made in the analysis. △ Less

Submitted 13 January, 2020; originally announced January 2020.

arXiv:1909.02995 [pdf, other]

Horseshoes for singly thermostated hamiltonians

Authors: Leo T. Butler

Abstract: This note studies 1 and 2 degree of freedom hamiltonian systems that are thermostated by a single-variable thermostat. Under certain conditions on the hamiltonian and thermostat, the existence of a horseshoe in the flow of the thermostated system is proven. This note studies 1 and 2 degree of freedom hamiltonian systems that are thermostated by a single-variable thermostat. Under certain conditions on the hamiltonian and thermostat, the existence of a horseshoe in the flow of the thermostated system is proven. △ Less

Submitted 6 September, 2019; originally announced September 2019.

Comments: 16 pages; 1 figure

MSC Class: 37J30; 53C17; 53C30; 53D25

arXiv:1909.02153 [pdf]

doi 10.1088/2053-1591/ab3bd3

Exploring Disorder in the Spin Gapless Semiconductor Mn$_2$CoAl

Authors: Robert G. Buckley, Tane Butler, Catherine Pot, Nicholas M. Strickland, Simon Granville

Abstract: Since the prediction of spin-gapless semiconducting behaviour in the Heusler compound Mn$_2$CoAl, evidence of spin-gapless behaviour in thin films has typically been inferred from magnetotransport measurements. The spin gapless state is however fragile, and further, band structure calculations indicate that even a small amount of atomic disorder may destroy it. To explore the impact of disorder on… ▽ More Since the prediction of spin-gapless semiconducting behaviour in the Heusler compound Mn$_2$CoAl, evidence of spin-gapless behaviour in thin films has typically been inferred from magnetotransport measurements. The spin gapless state is however fragile, and further, band structure calculations indicate that even a small amount of atomic disorder may destroy it. To explore the impact of disorder on the properties of Mn$_2$CoAl, we have undertaken an experimental study of the structural, magnetotransport and optical properties from the far infrared to the UV, on DC magnetron sputtered Mn$_2$CoAl thin films. A very short mean free path, of the order of a lattice spacing, is extracted from the DC transport data. A room temperature resistivity of 200 $μ$$Ω$cm along with a small and negative temperature coefficient of resistance between 4 and 400 K was measured. We note that parameters of this magnitude are often observed in disordered metals. We find this behaviour is well described by a weak localisation model, a result that is supported by a large Drude contribution to the optical response, where a high scattering rate is derived, which is equal to the value derived from the DC conductivity and Hall effect data. We also note the strong similarities between the magnetotransport behaviour reported for Mn$_2$CoAl films in the literature, including ours. We conclude that, based on comparisons between the experimental data, and recent band structure calculations that explicitly include disorder, as-prepared Mn$_2$CoAl films are best described as a disordered metal, rather than a spin gapless semiconductor. △ Less

Submitted 4 September, 2019; originally announced September 2019.

Comments: 16 pages, 7 figures

Journal ref: Materials Research Express 6, 106113 (2019)

arXiv:1908.08070 [pdf, other]

doi 10.1103/PhysRevApplied.15.054030

Quantum-statistical transport phenomena in memristive computing architectures

Authors: Christopher N. Singh, Brian A. Crafton, Mathew P. West, Alex S. Weidenbach, Keith T. Butler, Allan H. MacDonald, Arjit Raychowdury, Eric M. Vogel, W. Alan Doolittle, L. F. J. Piper, Wei-Cheng Lee

Abstract: The advent of reliable, nanoscale memristive components is promising for next generation compute-in-memory paradigms, however, the intrinsic variability in these devices has prevented widespread adoption. Here we show coherent electron wave functions play a pivotal role in the nanoscale transport properties of these emerging, non-volatile memories. By characterizing both filamentary and non-filame… ▽ More The advent of reliable, nanoscale memristive components is promising for next generation compute-in-memory paradigms, however, the intrinsic variability in these devices has prevented widespread adoption. Here we show coherent electron wave functions play a pivotal role in the nanoscale transport properties of these emerging, non-volatile memories. By characterizing both filamentary and non-filamentary memristive devices as disordered Anderson systems, the switching characteristics and intrinsic variability arise directly from the universality of electron transport in disordered media. Our framework suggests localization phenomena in nanoscale, solid-state memristive systems are directly linked to circuit level performance. We discuss how quantum conductance fluctuations in the active layer set a lower bound on device variability. This finding implies there is a fundamental quantum limit on the reliability of memristive devices, and electron coherence will play a decisive role in surpassing or maintaining Moore's Law with these systems. △ Less

Submitted 31 May, 2021; v1 submitted 21 August, 2019; originally announced August 2019.

Comments: 13 pages, 6 figures

Journal ref: Phys. Rev. Applied 15, 054030 (2021)

arXiv:1907.06324 [pdf, other]

doi 10.1039/C9SC03378E

Metal-free perovskites for non-linear optical materials

Authors: Thomas W. Kasel, Zeyu Deng, Austin M. Mroz, Christopher H. Hendon, Keith T. Butler, Pieremanuele Canepa

Abstract: We identify the existence of nonlinear optical (NLO) activity in a number of novel $ABX_3$-type metal-free perovskites, where $A$ is a highly tuneable organic cation, $B$ is a NH$_4$ cation and $X$ a halide anion. Through systematic first-principles calculations, we identify important trends to chart the second-harmonic generation of this class of materials. We study three perovskites MDABCO-NH… ▽ More We identify the existence of nonlinear optical (NLO) activity in a number of novel $ABX_3$-type metal-free perovskites, where $A$ is a highly tuneable organic cation, $B$ is a NH$_4$ cation and $X$ a halide anion. Through systematic first-principles calculations, we identify important trends to chart the second-harmonic generation of this class of materials. We study three perovskites MDABCO-NH$_4$I$_3$, CNDABCO-NH$_4$I$_3$ and ODABCO-NH$_4$I$_3$ for use as deep-UV second-harmonic generation materials. We identify the role of the dipole moment imparted by the organic group on the $A$ cation as an important parameter to tune the NLO properties of these materials. We apply this knowledge functionalising the organic group DABCO with the highly polar cyanide CN$^-$ group, and we demonstrate a significant improvement of the NLO response in this family of materials. These findings can accelerate the application of metal free perovskites as inexpensive, non-toxic, earth-abundant materials for the next generation of optical communication applications. △ Less

Submitted 14 July, 2019; originally announced July 2019.

Comments: 16 pages, 4 figures

Journal ref: Chem. Sci. (2019)

arXiv:1810.13219 [pdf, other]

doi 10.1063/1.5079485

Finding a junction partner for candidate solar cell absorbers enargite and bournonite from electronic band and lattice matching

Authors: Suzanne K. Wallace, Keith T. Butler, Yoyo Hinuma, Aron Walsh

Abstract: An essential step in the development of a new photovoltaic (PV) technology is choosing appropriate electron and hole extraction layers to make an efficient device. We recently proposed the minerals enargite (\enargite) and bournonite (\bournonite) as materials that are chemically stable with desirable optoelectronic properties for use as the absorber layer in a thin-film PV device. For these compo… ▽ More An essential step in the development of a new photovoltaic (PV) technology is choosing appropriate electron and hole extraction layers to make an efficient device. We recently proposed the minerals enargite (\enargite) and bournonite (\bournonite) as materials that are chemically stable with desirable optoelectronic properties for use as the absorber layer in a thin-film PV device. For these compounds, spontaneous lattice polarization with internal electric fields --- and potential ferroelectricity --- may allow for enhanced carrier separation and novel photophysical effects. In this work, we calculate the ionization potentials for non-polar surface terminations and propose suitable partners for forming solar cell heterojunctions by matching the electronic band edges to a set of candidate electrical contact materials. We then further screen these candidates by matching the lattice constants and identify those that are likely to minimise strain and achieve epitaxy. This two-step screening procedure identified a range of unconventional candidate contact materials including SnS2, ZnTe, WO3, and Bi2O3. △ Less

Submitted 31 October, 2018; originally announced October 2018.

Comments: 8 pages, 4 figures, 3 tables

Journal ref: Journal of Applied Physics 125, 055703 (2019)

arXiv:1808.00359 [pdf, other]

doi 10.1088/2515-7655/aad928

Quick-start guide for first-principles modelling of semiconductor interfaces

Authors: Ji-Sang Park, Young-Kwang Jung, Keith T. Butler, Aron Walsh

Abstract: Interfaces between dissimilar materials control the transport of energy in a range of technologies including solar cells (electron transport), batteries (ion transport), and thermoelectrics (heat transport). Advances in computer power and algorithms means that first-principles models of interfacial processes in realistic systems are now possible using accurate approaches such as density functional… ▽ More Interfaces between dissimilar materials control the transport of energy in a range of technologies including solar cells (electron transport), batteries (ion transport), and thermoelectrics (heat transport). Advances in computer power and algorithms means that first-principles models of interfacial processes in realistic systems are now possible using accurate approaches such as density functional theory. In this `quick-start guide', we discuss the best practice in how to construct atomic models between two materials and analysis techniques appropriate to probe changes in local bonding and electronic band offsets. A number of examples are given related to perovskite solar cells. △ Less

Submitted 1 August, 2018; originally announced August 2018.

Journal ref: J Phys Energy 1, 016001 (2019)

arXiv:1807.00375 [pdf, other]

doi 10.1137/18M1181675

Convergence of Probability Densities using Approximate Models for Forward and Inverse Problems in Uncertainty Quantification

Authors: T. Butler, J. D. Jakeman, T. Wildey

Abstract: We analyze the convergence of probability density functions utilizing approximate models for both forward and inverse problems. We consider the standard forward uncertainty quantification problem where an assumed probability density on parameters is propagated through the approximate model to produce a probability density, often called a push-forward probability density, on a set of quantities of… ▽ More We analyze the convergence of probability density functions utilizing approximate models for both forward and inverse problems. We consider the standard forward uncertainty quantification problem where an assumed probability density on parameters is propagated through the approximate model to produce a probability density, often called a push-forward probability density, on a set of quantities of interest (QoI). The inverse problem considered in this paper seeks a posterior probability density on model input parameters such that the subsequent push-forward density through the parameter-to-QoI map matches a given probability density on the QoI. We prove that the probability densities obtained from solving the forward and inverse problems, using approximate models, converge to the true probability densities as the approximate models converges to the true models. Numerical results are presented to demonstrate optimal convergence of probability densities for sparse grid approximations of parameter-to-QoI maps and standard spatial and temporal discretizations of PDEs and ODEs. △ Less

Submitted 1 July, 2018; originally announced July 2018.

MSC Class: 60H30; 60H35; 60B10

arXiv:1806.10198 [pdf, other]

Invariant tori for a class of singly thermostated hamiltonians

Authors: Leo T. Butler

Abstract: This paper demonstrates sufficient conditions for the existence of a positive measure set of invariant KAM tori in a singly thermostated, 1 degree-of-freedom hamiltonian vector field. This result is applied to 4 important single thermostats in the literature and it is shown that in each case, if the hamiltonian is real-analytic and well-behaved, then the thermostated system always has a positive m… ▽ More This paper demonstrates sufficient conditions for the existence of a positive measure set of invariant KAM tori in a singly thermostated, 1 degree-of-freedom hamiltonian vector field. This result is applied to 4 important single thermostats in the literature and it is shown that in each case, if the hamiltonian is real-analytic and well-behaved, then the thermostated system always has a positive measure set of invariant KAM tori for sufficiently weak coupling and high temperature. This extends results of Legoll, Luskin & Moeckel. △ Less

Submitted 3 August, 2019; v1 submitted 26 June, 2018; originally announced June 2018.

Comments: 27 pages, 8 figures

MSC Class: 37J30; 53C17; 53C30; 53D25

arXiv:1803.07970 [pdf]

doi 10.1021/acsami.8b01729

Band Engineering of Carbon Nitride Monolayers by N-type, P-type, and Isoelectronic Doping for Photocatalytic Applications

Authors: Meysam Makaremi, Sean Grixti, Keith T. Butler, Geoffrey A. Ozin, Chandra Veer Singh

Abstract: Since hydrogen fuel involves the highest energy density among all fuels, production of this gas through the solar water splitting approach has been suggested as a green remedy for greenhouse environmental issues due to extensive consumption of fossil fuels. Low dimensional materials possessing a large surface-to-volume ratio can be a promising candidate to be used for the photocatalytic approach.… ▽ More Since hydrogen fuel involves the highest energy density among all fuels, production of this gas through the solar water splitting approach has been suggested as a green remedy for greenhouse environmental issues due to extensive consumption of fossil fuels. Low dimensional materials possessing a large surface-to-volume ratio can be a promising candidate to be used for the photocatalytic approach. Here, we used extensive first principles calculations to investigate the application of newly fabricated members of two dimensional carbon nitrides including tg-C3N4, hg-C3N4, C2N, and C3N for water splitting. Band engineering via n-type, p-type, and isoelectronic doping agents such as B, N, P, Si, and Ge was demonstrated for tuning the electronic structure; optimizing solar absorption and band alignment for photocatalysis. Pristine tg-C3N4, hg-C3N4, and C2N crystals involve bandgaps of 3.190 eV, 2.772 eV, and 2.465 eV, respectively, which are not proper for water splitting. Among the dopants, Si and Ge dopants can narrow the band gap of carbon nitrides about 0.5 - 1.0 eV, and also increase their optical absorption in the visible spectrum. This study presents the potential for doping with isoelectronic elements to greatly improve the photocatalytic characteristics of carbon nitride nanostructures. △ Less

Submitted 21 March, 2018; originally announced March 2018.

Journal ref: ACS Appl. Mater. Interfaces 2018

arXiv:1710.01290 [pdf, ps, other]

doi 10.1007/s00222-004-0380-5

Invariant fibration of geodesic flows

Authors: Leo T. Butler

Abstract: Let (Σ, g) be a compact $C^2$ finslerian 3-manifold. If the geodesic flow of g is completely integrable, and the singular set is a tamely-embedded polyhedron, then $π_1(Σ)$ is almost polycyclic. On the other hand, if Σ is a compact, irreducible 3-manifold and $π_1(Σ)$ is infinite polycyclic while $π_2(Σ)$ is trivial, then Σ admits an analytic riemannian metric whose geodesic flow is completely int… ▽ More Let (Σ, g) be a compact $C^2$ finslerian 3-manifold. If the geodesic flow of g is completely integrable, and the singular set is a tamely-embedded polyhedron, then $π_1(Σ)$ is almost polycyclic. On the other hand, if Σ is a compact, irreducible 3-manifold and $π_1(Σ)$ is infinite polycyclic while $π_2(Σ)$ is trivial, then Σ admits an analytic riemannian metric whose geodesic flow is completely integrable and singular set is a real-analytic variety. Additional results in higher dimensions are proven. △ Less

Submitted 3 October, 2017; originally announced October 2017.

Comments: 28 pages. Published in 2005

MSC Class: 37J30; 37K10; 53C60; 53C22; 53D25

Journal ref: Topology. 4:44 (2005) pps. 769--789

Showing 1–50 of 81 results for author: Butler, T