-
Stretched Non-negative Matrix Factorization
Authors:
Ran Gu,
Yevgeny Rakita,
Ling Lan,
Zach Thatcher,
Gabrielle E. Kamm,
Daniel O'Nolan,
Brennan Mcbride,
Allison Wustrow,
James R. Neilson,
Karena W. Chapman,
Qiang Du,
Simon J. L. Billinge
Abstract:
An algorithm is described and tested that carries out a non negative matrix factorization (NMF) ignoring any stretching of the signal along the axis of the independent variable. This extended NMF model is called StretchedNMF. Variability in a set of signals due to this stretching is then ignored in the decomposition. This can be used, for example, to study sets of powder diffraction data collected…
▽ More
An algorithm is described and tested that carries out a non negative matrix factorization (NMF) ignoring any stretching of the signal along the axis of the independent variable. This extended NMF model is called StretchedNMF. Variability in a set of signals due to this stretching is then ignored in the decomposition. This can be used, for example, to study sets of powder diffraction data collected at different temperatures where the materials are undergoing thermal expansion. It gives a more meaningful decomposition in this case where the component signals resemble signals from chemical components in the sample. The StretchedNMF model introduces a new variable, the stretching factor, to describe any expansion of the signal. To solve StretchedNMF, we discretize it and employ Block Coordinate Descent framework algorithms. The initial experimental results indicate that StretchedNMF model outperforms the conventional NMF for sets of data with such an expansion. A further enhancement to StretchedNMF for the case of powder diffraction data from crystalline materials called Sparse-StretchedNMF, which makes use of the sparsity of the powder diffraction signals, allows correct extractions even for very small stretches where StretchedNMF struggles. As well as demonstrating the model performance on simulated PXRD patterns and atomic pair distribution functions (PDFs), it also proved successful when applied to real data taken from an in situ chemical reaction experiment.
△ Less
Submitted 25 November, 2023;
originally announced November 2023.
-
Towards a machine-readable literature: finding relevant papers based on an uploaded powder diffraction pattern
Authors:
Berrak Özer,
Martin A. Karlsen,
Zachary Thatcher,
Ling Lan,
Brian McMahon,
Peter R. Strickland,
Simon P. Westrip,
Koh S. Sang,
David G. Billing,
Dorthe B. Ravnsbæk,
Simon J. L. Billinge
Abstract:
We investigate a prototype application for machine-readable literature. The program is called "pyDataRecognition" and serves as an example of a data-driven literature search, where the literature search query is an experimental data-set provided by the user. The user uploads a powder pattern together with the radiation wavelength. The program compares the user data to a database of existing powder…
▽ More
We investigate a prototype application for machine-readable literature. The program is called "pyDataRecognition" and serves as an example of a data-driven literature search, where the literature search query is an experimental data-set provided by the user. The user uploads a powder pattern together with the radiation wavelength. The program compares the user data to a database of existing powder patterns associated with published papers and produces a rank ordered according to their similarity score. The program returns the digital object identifier (doi) and full reference of top ranked papers together with a stack plot of the user data alongside the top five database entries. The paper describes the approach and explores successes and challenges.
△ Less
Submitted 1 April, 2022;
originally announced April 2022.
-
nmfMapping: a cloud-based web application for non-negative matrix factorization of powder diffraction and pair distribution function datasets
Authors:
Zachary Thatcher,
Chia-Hao Liu,
Long Yang,
Brennan C. McBride,
Gia Thinh Tran,
Allison Wustrow,
Martin A. Karlsen,
James R. Neilson,
Dorthe B. Ravnsbæk,
Simon J. L. Billinge
Abstract:
A cloud-hosted web-based software application, nmfMapping, for carrying out a nonnegative matrix factorization of a set of powder diffraction or atomic pair distribution function datasets is described. This app allows structure scientists to rapidly find trends in sets of related data such as from in situ and operando diffraction experiments. The application is easy to use and does not require any…
▽ More
A cloud-hosted web-based software application, nmfMapping, for carrying out a nonnegative matrix factorization of a set of powder diffraction or atomic pair distribution function datasets is described. This app allows structure scientists to rapidly find trends in sets of related data such as from in situ and operando diffraction experiments. The application is easy to use and does not require any programming expertise. It is available at the pdfitc.org website.
△ Less
Submitted 25 December, 2021;
originally announced December 2021.