Massively parallel quantum chemistry: PFAS on over 1 million cloud vCPUs
Authors:
Alan E. Rask,
Lee Huntington,
SungYeon Kim,
David Walker,
Andrew Wildman,
Rodrigo Wang,
Nicole Hazel,
Alan Judi,
James T. Pegg,
Punit K. Jha,
Zara Mayimfor,
Carl Dukatz,
Hassan Naseri,
Ilan Gleiser,
Maxime R. Hugues,
Paul M. Zimmerman,
Arman Zaribafiyan,
Rudi Plesch,
Takeshi Yamazaki
Abstract:
Accurate solutions to the electronic Schrödinger equation can provide valuable insight for electron interactions within molecular systems, accelerating the molecular design and discovery processes in many different applications. However, the availability of such accurate solutions are limited to small molecular systems due to both the extremely high computational complexity and the challenge of op…
▽ More
Accurate solutions to the electronic Schrödinger equation can provide valuable insight for electron interactions within molecular systems, accelerating the molecular design and discovery processes in many different applications. However, the availability of such accurate solutions are limited to small molecular systems due to both the extremely high computational complexity and the challenge of operating and executing these workloads on high-performance compute clusters. This work presents a massively scalable cloud-based quantum chemistry platform by implementing a highly parallelizable quantum chemistry method that provides a polynomial-scaling approximation to full configuration interaction (FCI). Our platform orchestrates more than one million virtual CPUs on the cloud to analyze the bond-breaking behaviour of carbon-fluoride bonds of per- and polyfluoroalkyl substances (PFAS) with near-exact accuracy within the chosen basis set. This is the first quantum chemistry calculation utilizing more than one million virtual CPUs on the cloud and is the most accurate electronic structure computation of PFAS bond breaking to date.
△ Less
Submitted 20 July, 2023;
originally announced July 2023.
HypoML: Visual Analysis for Hypothesis-based Evaluation of Machine Learning Models
Authors:
Qianwen Wang,
William Alexander,
Jack Pegg,
Huamin Qu,
Min Chen
Abstract:
In this paper, we present a visual analytics tool for enabling hypothesis-based evaluation of machine learning (ML) models. We describe a novel ML-testing framework that combines the traditional statistical hypothesis testing (commonly used in empirical research) with logical reasoning about the conclusions of multiple hypotheses. The framework defines a controlled configuration for testing a numb…
▽ More
In this paper, we present a visual analytics tool for enabling hypothesis-based evaluation of machine learning (ML) models. We describe a novel ML-testing framework that combines the traditional statistical hypothesis testing (commonly used in empirical research) with logical reasoning about the conclusions of multiple hypotheses. The framework defines a controlled configuration for testing a number of hypotheses as to whether and how some extra information about a "concept" or "feature" may benefit or hinder a ML model. Because reasoning multiple hypotheses is not always straightforward, we provide HypoML as a visual analysis tool, with which, the multi-thread testing data is transformed to a visual representation for rapid observation of the conclusions and the logical flow between the testing data and hypotheses.We have applied HypoML to a number of hypothesized concepts, demonstrating the intuitive and explainable nature of the visual analysis.
△ Less
Submitted 12 February, 2020;
originally announced February 2020.