Skip to main content

Showing 1–26 of 26 results for author: Morgan, D

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.09977  [pdf, other

    cs.CE

    Physical regularized Hierarchical Generative Model for Metallic Glass Structural Generation and Energy Prediction

    Authors: Qiyuan Chen, Ajay Annamareddy, Ying-Fei Li, Dane Morgan, Bu Wang

    Abstract: Disordered materials such as glasses, unlike crystals, lack long range atomic order and have no periodic unit cells, yielding a high dimensional configuration space with widely varying properties. The complexity not only increases computational costs for atomistic simulations but also makes it difficult for generative AI models to deliver accurate property predictions and realistic structure gener… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

  2. arXiv:2503.12326  [pdf, other

    cs.CV cond-mat.mtrl-sci cs.AI

    Leveraging Vision Capabilities of Multimodal LLMs for Automated Data Extraction from Plots

    Authors: Maciej P. Polak, Dane Morgan

    Abstract: Automated data extraction from research texts has been steadily improving, with the emergence of large language models (LLMs) accelerating progress even further. Extracting data from plots in research papers, however, has been such a complex task that it has predominantly been confined to manual data extraction. We show that current multimodal large language models, with proper instructions and en… ▽ More

    Submitted 15 March, 2025; originally announced March 2025.

    Comments: 8 pages, 3 figures

  3. arXiv:2503.09814  [pdf

    cond-mat.mtrl-sci cs.LG

    A practical guide to machine learning interatomic potentials -- Status and future

    Authors: Ryan Jacobs, Dane Morgan, Siamak Attarian, Jun Meng, Chen Shen, Zhenghao Wu, Clare Yijia Xie, Julia H. Yang, Nongnuch Artrith, Ben Blaiszik, Gerbrand Ceder, Kamal Choudhary, Gabor Csanyi, Ekin Dogus Cubuk, Bowen Deng, Ralf Drautz, Xiang Fu, Jonathan Godwin, Vasant Honavar, Olexandr Isayev, Anders Johansson, Boris Kozinsky, Stefano Martiniani, Shyue Ping Ong, Igor Poltavsky , et al. (5 additional authors not shown)

    Abstract: The rapid development and large body of literature on machine learning interatomic potentials (MLIPs) can make it difficult to know how to proceed for researchers who are not experts but wish to use these tools. The spirit of this review is to help such researchers by serving as a practical, accessible guide to the state-of-the-art in MLIPs. This review paper covers a broad range of topics related… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Journal ref: Current Opinion in Solid State and Materials Science, 35, 101214 (2025)

  4. arXiv:2501.08465  [pdf, other

    cs.CV cond-mat.mtrl-sci

    Predicting Performance of Object Detection Models in Electron Microscopy Using Random Forests

    Authors: Ni Li, Ryan Jacobs, Matthew Lynch, Vidit Agrawal, Kevin Field, Dane Morgan

    Abstract: Quantifying prediction uncertainty when applying object detection models to new, unlabeled datasets is critical in applied machine learning. This study introduces an approach to estimate the performance of deep learning-based object detection models for quantifying defects in transmission electron microscopy (TEM) images, focusing on detecting irradiation-induced cavities in TEM images of metal al… ▽ More

    Submitted 14 January, 2025; originally announced January 2025.

    Comments: 14 pages, 9 figures, 3 tables

  5. arXiv:2412.10849  [pdf

    cs.AI cs.CL

    Superhuman performance of a large language model on the reasoning tasks of a physician

    Authors: Peter G. Brodeur, Thomas A. Buckley, Zahir Kanjee, Ethan Goh, Evelyn Bin Ling, Priyank Jain, Stephanie Cabral, Raja-Elie Abdulnour, Adrian D. Haimovich, Jason A. Freed, Andrew Olson, Daniel J. Morgan, Jason Hom, Robert Gallo, Liam G. McCoy, Haadi Mombini, Christopher Lucas, Misha Fotoohi, Matthew Gwiazdon, Daniele Restifo, Daniel Restrepo, Eric Horvitz, Jonathan Chen, Arjun K. Manrai, Adam Rodman

    Abstract: A seminal paper published by Ledley and Lusted in 1959 introduced complex clinical diagnostic reasoning cases as the gold standard for the evaluation of expert medical computing systems, a standard that has held ever since. Here, we report the results of a physician evaluation of a large language model (LLM) on challenging clinical cases against a baseline of hundreds of physicians. We conduct fiv… ▽ More

    Submitted 2 June, 2025; v1 submitted 14 December, 2024; originally announced December 2024.

  6. arXiv:2409.16380  [pdf, other

    cs.CV cs.LG

    Development and Application of a Sentinel-2 Satellite Imagery Dataset for Deep-Learning Driven Forest Wildfire Detection

    Authors: Valeria Martin, K. Brent Venable, Derek Morgan

    Abstract: Forest loss due to natural events, such as wildfires, represents an increasing global challenge that demands advanced analytical methods for effective detection and mitigation. To this end, the integration of satellite imagery with deep learning (DL) methods has become essential. Nevertheless, this approach requires substantial amounts of labeled data to produce accurate results. In this study, we… ▽ More

    Submitted 24 September, 2024; originally announced September 2024.

  7. arXiv:2409.06756  [pdf

    cs.LG cond-mat.mtrl-sci cs.AI

    Beyond designer's knowledge: Generating materials design hypotheses via large language models

    Authors: Quanliang Liu, Maciej P. Polak, So Yeon Kim, MD Al Amin Shuvo, Hrishikesh Shridhar Deodhar, Jeongsoo Han, Dane Morgan, Hyunseok Oh

    Abstract: Materials design often relies on human-generated hypotheses, a process inherently limited by cognitive constraints such as knowledge gaps and limited ability to integrate and extract knowledge implications, particularly when multidisciplinary expertise is required. This work demonstrates that large language models (LLMs), coupled with prompt engineering, can effectively generate non-trivial materi… ▽ More

    Submitted 10 September, 2024; originally announced September 2024.

  8. arXiv:2409.06080  [pdf

    cond-mat.mtrl-sci cs.LG

    Regression with Large Language Models for Materials and Molecular Property Prediction

    Authors: Ryan Jacobs, Maciej P. Polak, Lane E. Schultz, Hamed Mahdavi, Vasant Honavar, Dane Morgan

    Abstract: We demonstrate the ability of large language models (LLMs) to perform material and molecular property regression tasks, a significant deviation from the conventional LLM use case. We benchmark the Large Language Model Meta AI (LLaMA) 3 on several molecular properties in the QM9 dataset and 24 materials properties. Only composition-based input strings are used as the model input and we fine tune on… ▽ More

    Submitted 9 September, 2024; originally announced September 2024.

  9. arXiv:2408.01558  [pdf

    cs.CV cond-mat.mtrl-sci

    Accelerating Domain-Aware Electron Microscopy Analysis Using Deep Learning Models with Synthetic Data and Image-Wide Confidence Scoring

    Authors: Matthew J. Lynch, Ryan Jacobs, Gabriella Bruno, Priyam Patki, Dane Morgan, Kevin G. Field

    Abstract: The integration of machine learning (ML) models enhances the efficiency, affordability, and reliability of feature detection in microscopy, yet their development and applicability are hindered by the dependency on scarce and often flawed manually labeled datasets and a lack of domain awareness. We addressed these challenges by creating a physics-based synthetic image and data generator, resulting… ▽ More

    Submitted 2 August, 2024; originally announced August 2024.

  10. arXiv:2406.05143  [pdf, other

    cond-mat.mtrl-sci cond-mat.other cs.LG

    A General Approach for Determining Applicability Domain of Machine Learning Models

    Authors: Lane E. Schultz, Yiqi Wang, Ryan Jacobs, Dane Morgan

    Abstract: Knowledge of the domain of applicability of a machine learning model is essential to ensuring accurate and reliable model predictions. In this work, we develop a new and general approach of assessing model domain and demonstrate that our approach provides accurate and meaningful domain designation across multiple model types and material property data sets. Our approach assesses the distance betwe… ▽ More

    Submitted 22 March, 2025; v1 submitted 28 May, 2024; originally announced June 2024.

  11. arXiv:2404.09896  [pdf

    cs.LG cond-mat.mtrl-sci

    Accelerating Ensemble Error Bar Prediction with Single Models Fits

    Authors: Vidit Agrawal, Shixin Zhang, Lane E. Schultz, Dane Morgan

    Abstract: Ensemble models can be used to estimate prediction uncertainties in machine learning models. However, an ensemble of N models is approximately N times more computationally demanding compared to a single model when it is used for inference. In this work, we explore fitting a single model to predicted ensemble error bar data, which allows us to estimate uncertainties without the need for a full ense… ▽ More

    Submitted 15 April, 2024; originally announced April 2024.

    Comments: 14 pages, 4 figures, 1 table

  12. arXiv:2403.14712  [pdf, other

    cs.CY cs.AI

    AI for bureaucratic productivity: Measuring the potential of AI to help automate 143 million UK government transactions

    Authors: Vincent J. Straub, Youmna Hashem, Jonathan Bright, Satyam Bhagwanani, Deborah Morgan, John Francis, Saba Esnaashari, Helen Margetts

    Abstract: There is currently considerable excitement within government about the potential of artificial intelligence to improve public service productivity through the automation of complex but repetitive bureaucratic tasks, freeing up the time of skilled staff. Here, we explore the size of this opportunity, by mapping out the scale of citizen-facing bureaucratic decision-making procedures within UK centra… ▽ More

    Submitted 18 March, 2024; originally announced March 2024.

  13. arXiv:2401.01291  [pdf, other

    cs.CY

    Generative AI is already widespread in the public sector

    Authors: Jonathan Bright, Florence E. Enock, Saba Esnaashari, John Francis, Youmna Hashem, Deborah Morgan

    Abstract: Generative AI has the potential to transform how public services are delivered by enhancing productivity and reducing time spent on bureaucracy. Furthermore, unlike other types of artificial intelligence, it is a technology that has quickly become widely available for bottom-up adoption: essentially anyone can decide to make use of it in their day to day work. But to what extent is generative AI a… ▽ More

    Submitted 2 January, 2024; originally announced January 2024.

  14. arXiv:2310.06475  [pdf

    cs.CY

    Approaches to the Algorithmic Allocation of Public Resources: A Cross-disciplinary Review

    Authors: Saba Esnaashari, Jonathan Bright, John Francis, Youmna Hashem, Vincent Straub, Deborah Morgan

    Abstract: Allocation of scarce resources is a recurring challenge for the public sector: something that emerges in areas as diverse as healthcare, disaster recovery, and social welfare. The complexity of these policy domains and the need for meeting multiple and sometimes conflicting criteria has led to increased focus on the use of algorithms in this type of decision. However, little engagement between res… ▽ More

    Submitted 10 October, 2023; originally announced October 2023.

  15. arXiv:2303.14007  [pdf

    cs.CY cs.AI cs.HC

    'Team-in-the-loop': Ostrom's IAD framework 'rules in use' to map and measure contextual impacts of AI

    Authors: Deborah Morgan, Youmna Hashem, John Francis, Saba Esnaashari, Vincent J. Straub, Jonathan Bright

    Abstract: This article explores how the 'rules in use' from Ostrom's Institutional Analysis and Development Framework (IAD) can be developed as a context analysis approach for AI. AI risk assessment frameworks increasingly highlight the need to understand existing contexts. However, these approaches do not frequently connect with established institutional analysis scholarship. We outline a novel direction i… ▽ More

    Submitted 30 June, 2024; v1 submitted 24 March, 2023; originally announced March 2023.

    Comments: 19 pages

  16. arXiv:2303.10106  [pdf

    cs.CY cs.AI

    A multidomain relational framework to guide institutional AI research and adoption

    Authors: Vincent J. Straub, Deborah Morgan, Youmna Hashem, John Francis, Saba Esnaashari, Jonathan Bright

    Abstract: Calls for new metrics, technical standards and governance mechanisms to guide the adoption of Artificial Intelligence (AI) in institutions and public administration are now commonplace. Yet, most research and policy efforts aimed at understanding the implications of adopting AI tend to prioritize only a handful of ideas; they do not fully connect all the different perspectives and topics that are… ▽ More

    Submitted 17 July, 2023; v1 submitted 17 March, 2023; originally announced March 2023.

    Comments: 23 pages, 1 figure

  17. arXiv:2303.05352  [pdf, other

    cs.CL cond-mat.mtrl-sci

    Extracting Accurate Materials Data from Research Papers with Conversational Language Models and Prompt Engineering

    Authors: Maciej P. Polak, Dane Morgan

    Abstract: There has been a growing effort to replace manual extraction of data from research papers with automated data extraction based on natural language processing, language models, and recently, large language models (LLMs). Although these methods enable efficient extraction of data from large sets of research papers, they require a significant amount of up-front effort, expertise, and coding. In this… ▽ More

    Submitted 21 February, 2024; v1 submitted 7 March, 2023; originally announced March 2023.

    Comments: 7 pages, 3 figures, 1 table

    Journal ref: Nature Communications (2024) 15:1569

  18. arXiv:2302.04914  [pdf, other

    cond-mat.mtrl-sci cs.AI cs.CL

    Flexible, Model-Agnostic Method for Materials Data Extraction from Text Using General Purpose Language Models

    Authors: Maciej P. Polak, Shrey Modi, Anna Latosinska, Jinming Zhang, Ching-Wen Wang, Shaonan Wang, Ayan Deep Hazra, Dane Morgan

    Abstract: Accurate and comprehensive material databases extracted from research papers are crucial for materials science and engineering, but their development requires significant human effort. With large language models (LLMs) transforming the way humans interact with text, LLMs provide an opportunity to revolutionize data extraction. In this study, we demonstrate a simple and efficient method for extract… ▽ More

    Submitted 12 June, 2024; v1 submitted 9 February, 2023; originally announced February 2023.

    Comments: 13 pages, 4 figures

    Journal ref: Digital Discovery, 2024, 3, 1221-1235

  19. arXiv:2211.08194  [pdf

    cond-mat.mtrl-sci cs.CV cs.LG

    Machine learning for classifying and interpreting coherent X-ray speckle patterns

    Authors: Mingren Shen, Dina Sheyfer, Troy David Loeffler, Subramanian K. R. S. Sankaranarayanan, G. Brian Stephenson, Maria K. Y. Chan, Dane Morgan

    Abstract: Speckle patterns produced by coherent X-ray have a close relationship with the internal structure of materials but quantitative inversion of the relationship to determine structure from speckle patterns is challenging. Here, we investigate the link between coherent X-ray speckle patterns and sample structures using a model 2D disk system and explore the ability of machine learning to learn aspects… ▽ More

    Submitted 1 September, 2023; v1 submitted 15 November, 2022; originally announced November 2022.

  20. arXiv:2210.17218  [pdf

    cs.CY cs.AI cs.HC cs.LG eess.SY

    Artificial intelligence in government: Concepts, standards, and a unified framework

    Authors: Vincent J. Straub, Deborah Morgan, Jonathan Bright, Helen Margetts

    Abstract: Recent advances in artificial intelligence (AI), especially in generative language modelling, hold the promise of transforming government. Given the advanced capabilities of new AI systems, it is critical that these are embedded using standard operational procedures, clear epistemic criteria, and behave in alignment with the normative expectations of society. Scholars in multiple domains have subs… ▽ More

    Submitted 25 October, 2023; v1 submitted 31 October, 2022; originally announced October 2022.

    Comments: 35 pages with references and appendix, 3 tables, 2 figures

  21. arXiv:2110.08244  [pdf

    cs.CV cond-mat.mtrl-sci

    Performance, Successes and Limitations of Deep Learning Semantic Segmentation of Multiple Defects in Transmission Electron Micrographs

    Authors: Ryan Jacobs, Mingren Shen, Yuhan Liu, Wei Hao, Xiaoshan Li, Ruoyu He, Jacob RC Greaves, Donglin Wang, Zeming Xie, Zitong Huang, Chao Wang, Kevin G. Field, Dane Morgan

    Abstract: In this work, we perform semantic segmentation of multiple defect types in electron microscopy images of irradiated FeCrAl alloys using a deep learning Mask Regional Convolutional Neural Network (Mask R-CNN) model. We conduct an in-depth analysis of key model performance statistics, with a focus on quantities such as predicted distributions of defect shapes, defect sizes, and defect areal densitie… ▽ More

    Submitted 15 October, 2021; originally announced October 2021.

  22. arXiv:2108.08883  [pdf

    cs.CV cond-mat.mtrl-sci

    Multi defect detection and analysis of electron microscopy images with deep learning

    Authors: Mingren Shen, Guanzhao Li, Dongxia Wu, Yuhan Liu, Jacob Greaves, Wei Hao, Nathaniel J. Krakauer, Leah Krudy, Jacob Perez, Varun Sreenivasan, Bryan Sanchez, Oigimer Torres, Wei Li, Kevin Field, Dane Morgan

    Abstract: Electron microscopy is widely used to explore defects in crystal structures, but human detecting of defects is often time-consuming, error-prone, and unreliable, and is not scalable to large numbers of images or real-time analysis. In this work, we discuss the application of machine learning approaches to find the location and geometry of different defect clusters in irradiated steels. We show tha… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

  23. arXiv:2108.08882  [pdf

    cs.CV cond-mat.mtrl-sci

    A Deep Learning Based Automatic Defect Analysis Framework for In-situ TEM Ion Irradiations

    Authors: Mingren Shen, Guanzhao Li, Dongxia Wu, Yudai Yaguchi, Jack C. Haley, Kevin G. Field, Dane Morgan

    Abstract: Videos captured using Transmission Electron Microscopy (TEM) can encode details regarding the morphological and temporal evolution of a material by taking snapshots of the microstructure sequentially. However, manual analysis of such video is tedious, error-prone, unreliable, and prohibitively time-consuming if one wishes to analyze a significant fraction of frames for even videos of modest length… ▽ More

    Submitted 19 August, 2021; originally announced August 2021.

  24. arXiv:2010.15315  [pdf

    cs.CV cond-mat.mtrl-sci cs.LG eess.IV

    Exploring Generative Adversarial Networks for Image-to-Image Translation in STEM Simulation

    Authors: Nick Lawrence, Mingren Shen, Ruiqi Yin, Cloris Feng, Dane Morgan

    Abstract: The use of accurate scanning transmission electron microscopy (STEM) image simulation methods require large computation times that can make their use infeasible for the simulation of many images. Other simulation methods based on linear imaging models, such as the convolution method, are much faster but are too inaccurate to be used in application. In this paper, we explore deep learning models th… ▽ More

    Submitted 16 November, 2022; v1 submitted 28 October, 2020; originally announced October 2020.

  25. arXiv:2004.12935  [pdf, other

    cs.CL

    Natural language processing for achieving sustainable development: the case of neural labelling to enhance community profiling

    Authors: Costanza Conforti, Stephanie Hirmer, David Morgan, Marco Basaldella, Yau Ben Or

    Abstract: In recent years, there has been an increasing interest in the application of Artificial Intelligence - and especially Machine Learning - to the field of Sustainable Development (SD). However, until now, NLP has not been applied in this context. In this research paper, we show the high potential of NLP applications to enhance the sustainability of projects. In particular, we focus on the case of co… ▽ More

    Submitted 17 November, 2020; v1 submitted 27 April, 2020; originally announced April 2020.

    Comments: 18 pages, 9 figures. Accepted at EMNLP 2020

  26. arXiv:2002.11315  [pdf

    physics.comp-ph cond-mat.mtrl-sci cs.LG

    Assessing Graph-based Deep Learning Models for Predicting Flash Point

    Authors: Xiaoyu Sun, Nathaniel J. Krakauer, Alexander Politowicz, Wei-Ting Chen, Qiying Li, Zuoyi Li, Xianjia Shao, Alfred Sunaryo, Mingren Shen, James Wang, Dane Morgan

    Abstract: Flash points of organic molecules play an important role in preventing flammability hazards and large databases of measured values exist, although millions of compounds remain unmeasured. To rapidly extend existing data to new compounds many researchers have used quantitative structure-property relationship (QSPR) analysis to effectively predict flash points. In recent years graph-based deep learn… ▽ More

    Submitted 26 February, 2020; originally announced February 2020.

    Comments: 26 pages, 6 tabels, 3 figures

    Journal ref: Mol. Inf. 2020, 39, 1900101