Search | arXiv e-print repository

Battle Sheep is PSPACE-complete

Abstract: Battle Sheep is a board game published by Blue Orange Games. With two players, it is a combinatorial game that uses normal play rules. We show that it is PSPACE-complete, even when each stack has only up to 3 tokens. Battle Sheep is a board game published by Blue Orange Games. With two players, it is a combinatorial game that uses normal play rules. We show that it is PSPACE-complete, even when each stack has only up to 3 tokens. △ Less

Submitted 9 May, 2025; originally announced May 2025.

Comments: 11 pages, 11 figures

MSC Class: 91A46 ACM Class: F.1.3; F.2.2

arXiv:2504.03845 [pdf, other]

Approximate normalizations for approximate density functionals

Authors: Adam Clay, Kiril Datchev, Wenlan Miao, Adam Wasserman, Kimberly J. Daas, Kieron Burke

Abstract: It seems self-evident that a density functional calculation should be normalized to the number of electrons in the system. We present multiple examples where the accuracy of the approximate energy is improved (sometimes greatly) by violating this basic principle. In one dimension, we explicitly derive the appropriate correction to the normalization. Beyond one dimension, Weyl asymptotics for energ… ▽ More It seems self-evident that a density functional calculation should be normalized to the number of electrons in the system. We present multiple examples where the accuracy of the approximate energy is improved (sometimes greatly) by violating this basic principle. In one dimension, we explicitly derive the appropriate correction to the normalization. Beyond one dimension, Weyl asymptotics for energy levels yield these corrections for any cavity. We include examples with Coulomb potentials and the exchange energy of atoms to illustrate relevance to realistic calculations. △ Less

Submitted 4 April, 2025; originally announced April 2025.

Comments: 7 Pages + 10 SI, 5 Figures + 2 SI, txt files with data in Source

arXiv:2503.01709 [pdf, other]

Can machines learn density functionals? Past, present, and future of ML in DFT

Authors: Ryosuke Akashi, Mihira Sogal, Kieron Burke

Abstract: Density functional theory has become the world's favorite electronic structure method, and is routinely applied to both materials and molecules. Here, we review recent attempts to use modern machine-learning to improve density functional approximations. Many different researchers have tried many different approaches, but some common themes and lessons have emerged. We discuss these trends and wher… ▽ More Density functional theory has become the world's favorite electronic structure method, and is routinely applied to both materials and molecules. Here, we review recent attempts to use modern machine-learning to improve density functional approximations. Many different researchers have tried many different approaches, but some common themes and lessons have emerged. We discuss these trends and where they might bring us in the future. △ Less

Submitted 3 March, 2025; originally announced March 2025.

Comments: 46 pages, 5 figures, 2 tables. Submitted to "Machine Learning in Condensed Matter Physics - Significance, Challenges, and Future Directions" (Springer Series in Solid-State Sciences)

arXiv:2501.06574 [pdf, ps, other]

Col is PSPACE-complete on Triangular Grids

Authors: Kyle Burke, Craig Tennenhouse

Abstract: We demonstrate that Col is PSPACE-complete on triangular grid graphs via a reduction from Bounded Two-Player Constraint Logic. This is the most structured graph family that Col is known to be computationally hard for. We demonstrate that Col is PSPACE-complete on triangular grid graphs via a reduction from Bounded Two-Player Constraint Logic. This is the most structured graph family that Col is known to be computationally hard for. △ Less

Submitted 25 January, 2025; v1 submitted 11 January, 2025; originally announced January 2025.

Comments: 10 pages, 16 figures

MSC Class: 91A46 ACM Class: F.1.3; G.2.1; F.2.2

arXiv:2412.10388

AI-assisted summary of suicide risk Formulation

Authors: Rajib Rana, Niall Higgins, Kazi N. Haque, John Reilly, Kylie Burke, Kathryn Turner, Anthony R. Pisani, Terry Stedman

Abstract: Background: Formulation, associated with suicide risk assessment, is an individualised process that seeks to understand the idiosyncratic nature and development of an individual's problems. Auditing clinical documentation on an electronic health record (EHR) is challenging as it requires resource-intensive manual efforts to identify keywords in relevant sections of specific forms. Furthermore, cli… ▽ More Background: Formulation, associated with suicide risk assessment, is an individualised process that seeks to understand the idiosyncratic nature and development of an individual's problems. Auditing clinical documentation on an electronic health record (EHR) is challenging as it requires resource-intensive manual efforts to identify keywords in relevant sections of specific forms. Furthermore, clinicians and healthcare professionals often do not use keywords; their clinical language can vary greatly and may contain various jargon and acronyms. Also, the relevant information may be recorded elsewhere. This study describes how we developed advanced Natural Language Processing (NLP) algorithms, a branch of Artificial Intelligence (AI), to analyse EHR data automatically. Method: Advanced Optical Character Recognition techniques were used to process unstructured data sets, such as portable document format (pdf) files. Free text data was cleaned and pre-processed using Normalisation of Free Text techniques. We developed algorithms and tools to unify the free text. Finally, the formulation was checked for the presence of each concept based on similarity using NLP-powered semantic matching techniques. Results: We extracted information indicative of formulation and assessed it to cover the relevant concepts. This was achieved using a Weighted Score to obtain a Confidence Level. Conclusion: The rigour to which formulation is completed is crucial to effectively using EHRs, ensuring correct and timely identification, engagement and interventions that may potentially avoid many suicide attempts and suicides. △ Less

Submitted 19 December, 2024; v1 submitted 29 November, 2024; originally announced December 2024.

Comments: Have some incorrect information

arXiv:2412.00057 [pdf, other]

Feasibility of Mental Health Triage Call Priority Prediction Using Machine Learning

Authors: Rajib Rana, Niall Higgins, Kazi Nazmul Haque, John Reilly, Kylie Burke, Kathryn Turner, Terry Stedman

Abstract: Ensuring accurate call prioritisation is essential for optimising the efficiency and responsiveness of mental health helplines. Currently, call operators rely entirely on the caller's statements to determine the priority of the calls. It has been shown that entirely subjective assessment can lead to errors. Furthermore, it is a missed opportunity not to utilise the voice properties readily availab… ▽ More Ensuring accurate call prioritisation is essential for optimising the efficiency and responsiveness of mental health helplines. Currently, call operators rely entirely on the caller's statements to determine the priority of the calls. It has been shown that entirely subjective assessment can lead to errors. Furthermore, it is a missed opportunity not to utilise the voice properties readily available during the call to aid in the evaluation. Incorrect prioritisation can result in delayed assistance for high-risk individuals, resource misallocation, increased mental health deterioration, loss of trust, and potential legal consequences. It is vital to address these risks to guarantee the reliability and effectiveness of mental health services. This study delves into the potential of using machine learning, a branch of Artificial Intelligence, to estimate call priority from the callers' voices for users of mental health phone helplines. After analysing 459 call records from a mental health helpline, we achieved a balanced accuracy of 92\%, showing promise in aiding the call operators' efficiency in call handling processes and improving customer satisfaction. △ Less

Submitted 24 November, 2024; originally announced December 2024.

arXiv:2404.10390 [pdf, ps, other]

Complexity and algorithms for Arc-Kayles and Non-Disconnecting Arc-Kayles

Authors: Kyle Burke, Antoine Dailly, Nacim Oijid

Abstract: Arc-Kayles is a game where two players alternate removing two adjacent vertices until no move is left. Introduced in 1978, its computational complexity is still open. More recently, subtraction games, where the players cannot disconnect the graph while removing vertices, were introduced. In particular, Arc-Kayles admits a non-disconnecting variant that is a subtraction game. We study the computati… ▽ More Arc-Kayles is a game where two players alternate removing two adjacent vertices until no move is left. Introduced in 1978, its computational complexity is still open. More recently, subtraction games, where the players cannot disconnect the graph while removing vertices, were introduced. In particular, Arc-Kayles admits a non-disconnecting variant that is a subtraction game. We study the computational complexity of subtraction games on graphs, proving that they are PSPACE-complete even on very structured graph classes (split, bipartite of any even girth). We prove that Non-Disconnecting Arc-Kayles can be solved in polynomial-time on unicyclic graphs, clique trees, and subclasses of threshold graphs. We also show that a sufficient condition for a second player-win on Arc-Kayles is equivalent to the graph isomorphism problem. △ Less

Submitted 16 April, 2024; originally announced April 2024.

arXiv:2403.04955 [pdf, other]

A Tractability Gap Beyond Nim-Sums: It's Hard to Tell Whether a Bunch of Superstars Are Losers

Authors: Kyle Burke, Matthew Ferland, Svenja Huntemann, Shang-Hua Teng

Abstract: In this paper, we address a natural question at the intersection of combinatorial game theory and computational complexity: "Can a sum of simple tepid games in canonical form be intractable?" To resolve this fundamental question, we consider superstars, positions first introduced in Winning Ways where all options are nimbers. Extending Morris' classic result with hot games to tepid games, we prove… ▽ More In this paper, we address a natural question at the intersection of combinatorial game theory and computational complexity: "Can a sum of simple tepid games in canonical form be intractable?" To resolve this fundamental question, we consider superstars, positions first introduced in Winning Ways where all options are nimbers. Extending Morris' classic result with hot games to tepid games, we prove that disjunctive sums of superstars are intractable to solve. This is striking as sums of nimbers can be computed in linear time. Our analyses also lead to a family of elegant board games with intriguing complexity, for which we present web-playable versions of the rulesets described within. △ Less

Submitted 7 March, 2024; originally announced March 2024.

Comments: 17 pages

MSC Class: 91A46 ACM Class: F.1.3; G.2; F.2.2

arXiv:2403.03364 [pdf, other]

Exchange-correlation energy from Green's functions

Authors: Steven Crisostomo, E. K. U. Gross, Kieron Burke

Abstract: DFT calculations yield useful ground-state energies and densities, while Green's function techniques (such as $GW$) are mostly used to produce spectral functions. From the Galitskii-Migdal formula, we extract the exchange-correlation of DFT directly from a Green's function. This spectral representation provides an alternative to the fluctuation-dissipation theorem of DFT, identifying distinct sing… ▽ More DFT calculations yield useful ground-state energies and densities, while Green's function techniques (such as $GW$) are mostly used to produce spectral functions. From the Galitskii-Migdal formula, we extract the exchange-correlation of DFT directly from a Green's function. This spectral representation provides an alternative to the fluctuation-dissipation theorem of DFT, identifying distinct single-particle and many-particle contributions. Results are illustrated on the uniform electron gas and the two-site Hubbard model. △ Less

Submitted 11 March, 2024; v1 submitted 5 March, 2024; originally announced March 2024.

arXiv:2311.08139 [pdf, other]

Feedforward neural networks as statistical models: Improving interpretability through uncertainty quantification

Authors: Andrew McInerney, Kevin Burke

Abstract: Feedforward neural networks (FNNs) are typically viewed as pure prediction algorithms, and their strong predictive performance has led to their use in many machine-learning applications. However, their flexibility comes with an interpretability trade-off; thus, FNNs have been historically less popular among statisticians. Nevertheless, classical statistical theory, such as significance testing and… ▽ More Feedforward neural networks (FNNs) are typically viewed as pure prediction algorithms, and their strong predictive performance has led to their use in many machine-learning applications. However, their flexibility comes with an interpretability trade-off; thus, FNNs have been historically less popular among statisticians. Nevertheless, classical statistical theory, such as significance testing and uncertainty quantification, is still relevant. Supplementing FNNs with methods of statistical inference, and covariate-effect visualisations, can shift the focus away from black-box prediction and make FNNs more akin to traditional statistical models. This can allow for more inferential analysis, and, hence, make FNNs more accessible within the statistical-modelling context. △ Less

Submitted 14 November, 2023; originally announced November 2023.

MSC Class: 62J02; 68T07; 62F03

arXiv:2310.03982 [pdf, other]

Investigations of the exchange energy of neutral atoms in the large-Z limit

Authors: Jeremy J. Redd, Antonio C. Cancio, Nathan Argaman, Kieron Burke

Abstract: The non-relativistic large-$Z$ expansion of the exchange energy of neutral atoms provides an important input to modern non-empirical density functional approximations. Recent works report results of fitting the terms beyond the dominant term, given by the local density approximation (LDA), leading to an anomalous ZlnZ term that can not be predicted from naive scaling arguments. Here, we provide mu… ▽ More The non-relativistic large-$Z$ expansion of the exchange energy of neutral atoms provides an important input to modern non-empirical density functional approximations. Recent works report results of fitting the terms beyond the dominant term, given by the local density approximation (LDA), leading to an anomalous ZlnZ term that can not be predicted from naive scaling arguments. Here, we provide much more detailed data analysis of the mostly smooth asymptotic trend describing the difference between exact and LDA exchange energy, the nature of oscillations across rows of the periodic table, and the behavior of the LDA contribution itself. Special emphasis is given to the successes and difficulties in reproducing the exchange energy and its asymptotics with existing density functional approximations. △ Less

Submitted 5 October, 2023; originally announced October 2023.

Comments: 15 pages 11 figures + 4 pages supplemental information

arXiv:2308.03319 [pdf, other]

Generalized Gradient Approximation Made Thermal

Authors: John Kozlowski, Dennis Perchak, Kieron Burke

Abstract: Using the methodology of conditional-probability density functional theory, and several mild assumptions, we calculate the temperature-dependence of the Perdew-Burke-Ernzerhof (PBE) generalized gradient approximation (GGA). This numerically-defined thermal GGA reduces to the local approximation in the uniform limit and PBE at zero temperature, and can be fit reasonably accurately (within 8%) assum… ▽ More Using the methodology of conditional-probability density functional theory, and several mild assumptions, we calculate the temperature-dependence of the Perdew-Burke-Ernzerhof (PBE) generalized gradient approximation (GGA). This numerically-defined thermal GGA reduces to the local approximation in the uniform limit and PBE at zero temperature, and can be fit reasonably accurately (within 8%) assuming the temperature-dependent enhancement is independent of the gradient. This locally thermal PBE satisfies both the coordinate-scaled correlation inequality and the concavity condition, which we prove for finite temperatures. The temperature dependence differs markedly from existing thermal GGA's. △ Less

Submitted 15 August, 2023; v1 submitted 7 August, 2023; originally announced August 2023.

Comments: 6 pages, 5 figures

arXiv:2307.10734 [pdf, other]

DC-DFT for Open Shells: How to Deal with Spin Contamination

Authors: Hayoung Yu, Suhwan Song, Seungsoo Nam, Kieron Burke, Eunji Sim

Abstract: Density functional theory (DFT) is widely used to predict chemical properties, but its accuracy is limited by functional approximations and their approximate self-consistent densities. Density-corrected DFT (DC-DFT) is the study of the errors due to densities and Hartree-Fock DFT (HF-DFT) uses HF densities to improve energetics. With increasing use of HF-DFT, the question of how to address strong… ▽ More Density functional theory (DFT) is widely used to predict chemical properties, but its accuracy is limited by functional approximations and their approximate self-consistent densities. Density-corrected DFT (DC-DFT) is the study of the errors due to densities and Hartree-Fock DFT (HF-DFT) uses HF densities to improve energetics. With increasing use of HF-DFT, the question of how to address strong spin contamination in the HF calculation becomes increasingly important. We compare two different open-shell HF densities across 13 different DFT functionals and two DC-DFT methods. For significant spin contamination, ROHF densities outperform UHF densities by as much as a factor of 3, depending on the energy functional, and ROHF-DFT improves over self-consistent DFT for most of the tested functionals. We refine the DC(HF)-DFT algorithm, recommending ROHF-DFT in cases of severe spin contamination. △ Less

Submitted 20 July, 2023; originally announced July 2023.

Comments: 10 pages, 4 figures, 2 tables

arXiv:2307.00187 [pdf, other]

Exact Conditions for Ensemble Density Functional Theory

Authors: Thais R. Scott, John Kozlowski, Steven Crisostomo, Aurora Pribram-Jones, Kieron Burke

Abstract: Ensemble density functional theory (EDFT) is a promising alternative to time-dependent density functional theory for computing electronic excitation energies. Using coordinate scaling, we prove several fundamental exact conditions in EDFT and illustrate them on the exact singlet bi-ensemble of the Hubbard dimer. Several approximations violate these conditions, and some ground-state conditions from… ▽ More Ensemble density functional theory (EDFT) is a promising alternative to time-dependent density functional theory for computing electronic excitation energies. Using coordinate scaling, we prove several fundamental exact conditions in EDFT and illustrate them on the exact singlet bi-ensemble of the Hubbard dimer. Several approximations violate these conditions, and some ground-state conditions from quantum chemistry do not generalize to EDFT. The strong-correlation limit is derived for the dimer, revealing weight-dependent derivative discontinuities in EDFT. △ Less

Submitted 30 June, 2023; originally announced July 2023.

Comments: 6 pages 5 figures

arXiv:2304.11115 [pdf, other]

Orbital-free functional with sub-milliHartree errors for slabs

Authors: Pavel Okun, Antonio C. Cancio, Kieron Burke

Abstract: Using principles of asymptotic analysis, we derive the exact leading corrections to the Thomas-Fermi kinetic energy approximation for Kohn-Sham electrons for slabs. This asymptotic expansion approximation includes crucial quantum oscillations missed by standard semilocal density functionals. Because these account for the derivative discontinuity, chemical accuracy is achieved at fourth-order. The… ▽ More Using principles of asymptotic analysis, we derive the exact leading corrections to the Thomas-Fermi kinetic energy approximation for Kohn-Sham electrons for slabs. This asymptotic expansion approximation includes crucial quantum oscillations missed by standard semilocal density functionals. Because these account for the derivative discontinuity, chemical accuracy is achieved at fourth-order. The implications for both orbital-free electronic structure and exchange-correlation approximations are discussed. △ Less

Submitted 20 February, 2024; v1 submitted 22 March, 2023; originally announced April 2023.

arXiv:2303.01766 [pdf, other]

The difference between molecules and materials: Reassessing the role of exact conditions in density functional theory

Authors: Ryan Pederson, Kieron Burke

Abstract: Exact conditions have long been used to guide the construction of density functional approximations. But hundreds of empirical-based approximations tailored for chemistry are in use, many of which neglect these conditions in their design. We analyze well-known conditions and revive several obscure ones. Two crucial distinctions are drawn: that between necessary and sufficient conditions, and betwe… ▽ More Exact conditions have long been used to guide the construction of density functional approximations. But hundreds of empirical-based approximations tailored for chemistry are in use, many of which neglect these conditions in their design. We analyze well-known conditions and revive several obscure ones. Two crucial distinctions are drawn: that between necessary and sufficient conditions, and between all electronic densities and the subset of realistic Coulombic ground states. Simple search algorithms find that many empirical approximations satisfy many exact conditions for realistic densities and non-empirical approximations satisfy even more conditions than those enforced in their construction. The role of exact conditions in developing approximations is revisited. △ Less

Submitted 12 August, 2023; v1 submitted 3 March, 2023; originally announced March 2023.

arXiv:2301.06127 [pdf, ps, other]

Forced Capture Hnefatafl

Authors: Kyle Burke, Craig Tennenhouse

Abstract: We define a new, partizan, loopy combinatorial game, Forced-Capture Hnefatafl, similar to Hnefatafl, except that players are forced to make capturing moves when available. We show that this game is PSPACE-hard using a reduction from Constraint Logic, making progress towards classifying proper Hnefatafl. We define a new, partizan, loopy combinatorial game, Forced-Capture Hnefatafl, similar to Hnefatafl, except that players are forced to make capturing moves when available. We show that this game is PSPACE-hard using a reduction from Constraint Logic, making progress towards classifying proper Hnefatafl. △ Less

Submitted 20 June, 2023; v1 submitted 15 January, 2023; originally announced January 2023.

Comments: 12 pages, 9 figures, 4 open problems

MSC Class: 91A46 ACM Class: F.1.3; G.2.1

arXiv:2212.10631 [pdf, other]

Vexing Vexillological Logic

Authors: Kyle Burke, Craig Tennenhouse

Abstract: We define a new impartial combinatorial game, Flag Coloring, based on flood filling. We then generalize to a graph game, and find values for many positions on two colors. We demonstrate that the generalized game is PSPACE-complete for two colors or more via a reduction from Avoid True, determine the outcome classes of games based on real-world flags, and discuss remaining open problems. We define a new impartial combinatorial game, Flag Coloring, based on flood filling. We then generalize to a graph game, and find values for many positions on two colors. We demonstrate that the generalized game is PSPACE-complete for two colors or more via a reduction from Avoid True, determine the outcome classes of games based on real-world flags, and discuss remaining open problems. △ Less

Submitted 20 December, 2022; originally announced December 2022.

Comments: 22 pages, 8 figures, 10 tables

MSC Class: 91A46 ACM Class: F.1.3; G.2.1; G.2.2

arXiv:2212.07317 [pdf, other]

Robust Distributional Regression with Automatic Variable Selection

Authors: Meadhbh O'Neill, Kevin Burke

Abstract: Datasets with extreme observations and/or heavy-tailed error distributions are commonly encountered and should be analyzed with careful consideration of these features from a statistical perspective. Small deviations from an assumed model, such as the presence of outliers, can cause classical regression procedures to break down, potentially leading to unreliable inferences. Other distributional de… ▽ More Datasets with extreme observations and/or heavy-tailed error distributions are commonly encountered and should be analyzed with careful consideration of these features from a statistical perspective. Small deviations from an assumed model, such as the presence of outliers, can cause classical regression procedures to break down, potentially leading to unreliable inferences. Other distributional deviations, such as heteroscedasticity, can be handled by going beyond the mean and modelling the scale parameter in terms of covariates. We propose a method that accounts for heavy tails and heteroscedasticity through the use of a generalized normal distribution (GND). The GND contains a kurtosis-characterizing shape parameter that moves the model smoothly between the normal distribution and the heavier-tailed Laplace distribution - thus covering both classical and robust regression. A key component of statistical inference is determining the set of covariates that influence the response variable. While correctly accounting for kurtosis and heteroscedasticity is crucial to this endeavour, a procedure for variable selection is still required. For this purpose, we use a novel penalized estimation procedure that avoids the typical computationally demanding grid search for tuning parameters. This is particularly valuable in the distributional regression setting where the location and scale parameters depend on covariates, since the standard approach would have multiple tuning parameters (one for each distributional parameter). We achieve this by using a "smooth information criterion" that can be optimized directly, where the tuning parameters are fixed at log(n) in the BIC case. △ Less

Submitted 11 January, 2023; v1 submitted 14 December, 2022; originally announced December 2022.

MSC Class: 62J07

arXiv:2210.07167 [pdf, other]

Analysis of mean-field approximation for Deffuant opinion dynamics on networks

Authors: Alina Dubovskaya, Susan C. Fennell, Kevin Burke, James P. Gleeson, Doireann O'Kiely

Abstract: Mean-field equations have been developed recently to approximate the dynamics of the Deffuant model of opinion formation. These equations can describe both fully-mixed populations and the case where individuals interact only along edges of a network. In each case, interactions only occur between individuals whose opinions differ by less than a given parameter, called the confidence bound. The size… ▽ More Mean-field equations have been developed recently to approximate the dynamics of the Deffuant model of opinion formation. These equations can describe both fully-mixed populations and the case where individuals interact only along edges of a network. In each case, interactions only occur between individuals whose opinions differ by less than a given parameter, called the confidence bound. The size of the confidence bound parameter is known to strongly affect both the dynamics and the number and location of opinion clusters. In this work we carry out a mathematical analysis of the mean-field equations to investigate the role of the confidence bound and boundaries on these important observables of the model. We consider the limit in which the confidence bound interval is small, and identify the key mechanisms driving opinion evolution. We show that linear stability analysis can predict the number and location of opinion clusters. Comparison with numerical simulations of the model illustrates that the early-time dynamics and the final cluster locations can be accurately approximated for networks composed of two degree classes, as well as for the case of a fully-mixed population. △ Less

Submitted 13 October, 2022; originally announced October 2022.

MSC Class: 91D30; 35R09; 35Q91

arXiv:2207.05794 [pdf, other]

doi 10.1007/s11005-023-01665-z

Seven Useful Questions in Density Functional Theory

Authors: Steven Crisostomo, Ryan Pederson, John Kozlowski, Bhupalee Kalita, Antonio C. Cancio, Kiril Datchev, Adam Wasserman, Suhwan Song, Kieron Burke

Abstract: We explore a variety of unsolved problems in density functional theory, where mathematicians might prove useful. We give the background and context of the different problems, and why progress toward resolving them would help those doing computations using density functional theory. Subjects covered include the magnitude of the kinetic energy in Hartree-Fock calculations, the shape of adiabatic con… ▽ More We explore a variety of unsolved problems in density functional theory, where mathematicians might prove useful. We give the background and context of the different problems, and why progress toward resolving them would help those doing computations using density functional theory. Subjects covered include the magnitude of the kinetic energy in Hartree-Fock calculations, the shape of adiabatic connection curves, using the constrained search with input densities, densities of states, the semiclassical expansion of energies, the tightness of Lieb-Oxford bounds, and how we decide the accuracy of an approximate density. △ Less

Submitted 29 March, 2023; v1 submitted 12 July, 2022; originally announced July 2022.

Comments: 21 pages, 6 figures

arXiv:2207.04248 [pdf, other]

A Statistical-Modelling Approach to Feedforward Neural Network Model Selection

Authors: Andrew McInerney, Kevin Burke

Abstract: Feedforward neural networks (FNNs) can be viewed as non-linear regression models, where covariates enter the model through a combination of weighted summations and non-linear functions. Although these models have some similarities to the approaches used within statistical modelling, the majority of neural network research has been conducted outside of the field of statistics. This has resulted in… ▽ More Feedforward neural networks (FNNs) can be viewed as non-linear regression models, where covariates enter the model through a combination of weighted summations and non-linear functions. Although these models have some similarities to the approaches used within statistical modelling, the majority of neural network research has been conducted outside of the field of statistics. This has resulted in a lack of statistically-based methodology, and, in particular, there has been little emphasis on model parsimony. Determining the input layer structure is analogous to variable selection, while the structure for the hidden layer relates to model complexity. In practice, neural network model selection is often carried out by comparing models using out-of-sample performance. However, in contrast, the construction of an associated likelihood function opens the door to information-criteria-based variable and architecture selection. A novel model selection method, which performs both input- and hidden-node selection, is proposed using the Bayesian information criterion (BIC) for FNNs. The choice of BIC over out-of-sample performance as the model selection objective function leads to an increased probability of recovering the true model, while parsimoniously achieving favourable out-of-sample performance. Simulation studies are used to evaluate and justify the proposed method, and applications on real data are investigated. △ Less

Submitted 1 May, 2024; v1 submitted 9 July, 2022; originally announced July 2022.

MSC Class: 62J02; 68T07

arXiv:2207.04169 [pdf, other]

Extending density functional theory with near chemical accuracy beyond pure water

Authors: Suhwan Song, Stefan Vuckovic, Youngsam Kim, Hayoung Yu, Eunji Sim, Kieron Burke

Abstract: Density functional simulations of condensed phase water are typically inaccurate, due to the inaccuracies of approximate functionals. A recent breakthrough showed that the SCAN approximation can yield chemical accuracy for pure water in all its phases, but only when its density is corrected. This is a crucial step toward first-principles biosimulations. However, weak dispersion forces are ubiquito… ▽ More Density functional simulations of condensed phase water are typically inaccurate, due to the inaccuracies of approximate functionals. A recent breakthrough showed that the SCAN approximation can yield chemical accuracy for pure water in all its phases, but only when its density is corrected. This is a crucial step toward first-principles biosimulations. However, weak dispersion forces are ubiquitous and play a key role in noncovalent interactions among biomolecules, but are not included in the new approach. Moreover, naïve inclusion of dispersion in HF-SCAN ruins its high accuracy for pure water. Systematic application of the principles of density-corrected DFT yields a functional (HF-r$^2$SCAN-DC4) which recovers and not only improves over HF-SCAN for pure water, but also captures vital noncovalent interactions in biomolecules, making it suitable for simulations of solutions. △ Less

Submitted 8 February, 2023; v1 submitted 8 July, 2022; originally announced July 2022.

arXiv:2206.11372 [pdf, other]

doi 10.1063/5.0105684

Can the Hartree-Fock kinetic energy exceed the exact kinetic energy?

Authors: Steven Crisostomo, Mel Levy, Kieron Burke

Abstract: The Hartree-Fock (HF) approximation has been an important tool for quantum-chemical calculations since its earliest appearance in the late 1920s, and remains the starting point of most single-reference methods in use today. Intuition suggests that the HF kinetic energy should not exceed the exact kinetic energy, but no proof of this conjecture exists, despite a near century of development. Beginni… ▽ More The Hartree-Fock (HF) approximation has been an important tool for quantum-chemical calculations since its earliest appearance in the late 1920s, and remains the starting point of most single-reference methods in use today. Intuition suggests that the HF kinetic energy should not exceed the exact kinetic energy, but no proof of this conjecture exists, despite a near century of development. Beginning from a generalized virial theorem derived from scaling considerations, we derive a general expression for the kinetic energy difference that applies to all systems. For any atom or ion this trivially reduces to the well-known result that the total energy is the negative of the kinetic energy and since correlation energies are never positive, proves the conjecture in this case. Similar considerations apply to molecules at their equilibrium bond lengths. We use highly precise calculations on Hooke's atom (two electrons in a parabolic well) to test the conjecture in a non-trivial case, and to parameterize the difference between density-functional and HF quantities, but find no violations of the conjecture. △ Less

Submitted 22 February, 2023; v1 submitted 22 June, 2022; originally announced June 2022.

Comments: 7 pages, 6 figures; final edits

arXiv:2206.06199 [pdf, other]

How do classroom-turnover times depend on lecture-hall size?

Authors: Joseph Benson, Mariya Bessonov, Korana Burke, Simone Cassani, Maria-Veronica Ciocanel, Daniel B. Cooney, Alexandria Volkening

Abstract: Academic spaces in colleges and universities span classrooms for 10 students to lecture halls that hold over 600 people. During the break between consecutive classes, students from the first class must leave and the new class must find their desks, regardless of whether the room holds 10 or 600 people. Here we address the question of how the size of large lecture halls affects classroom-turnover t… ▽ More Academic spaces in colleges and universities span classrooms for 10 students to lecture halls that hold over 600 people. During the break between consecutive classes, students from the first class must leave and the new class must find their desks, regardless of whether the room holds 10 or 600 people. Here we address the question of how the size of large lecture halls affects classroom-turnover times, focusing on non-emergency settings. By adapting the established social-force model, we treat students as individuals who interact and move through classrooms to reach their destinations. We find that social interactions and the separation time between consecutive classes strongly influence how long it takes entering students to reach their desks, and that these effects are more pronounced in larger lecture halls. While the median time that individual students must travel increases with decreased separation time, we find that shorter separation times lead to shorter classroom-turnover times overall. This suggests that the effects of scheduling gaps and lecture-hall size on classroom dynamics depends on the perspective - individual student or whole class - that one chooses to take. △ Less

Submitted 23 June, 2022; v1 submitted 13 June, 2022; originally announced June 2022.

Comments: 29 pages, 12 figures, 4 tables, version 2: corrected error in author affiliations

MSC Class: 34; 91; 92

arXiv:2205.01591 [pdf, other]

doi 10.1038/s42254-022-00470-2

Machine learning and density functional theory

Authors: Ryan Pederson, Bhupalee Kalita, Kieron Burke

Abstract: Over the past decade machine learning has made significant advances in approximating density functionals, but whether this signals the end of human-designed functionals remains to be seen. Ryan Pederson, Bhupalee Kalita and Kieron Burke discuss the rise of machine learning for functional design. Over the past decade machine learning has made significant advances in approximating density functionals, but whether this signals the end of human-designed functionals remains to be seen. Ryan Pederson, Bhupalee Kalita and Kieron Burke discuss the rise of machine learning for functional design. △ Less

Submitted 22 May, 2022; v1 submitted 3 May, 2022; originally announced May 2022.

arXiv:2204.04524 [pdf, other]

doi 10.1088/1751-8121/ac8b45

Asymptotics of eigenvalue sums when some turning points are complex

Authors: Pavel Okun, Kieron Burke

Abstract: Recent work has shown a deep connection between semilocal approximations in density functional theory and the asymptotics of the sum of the WKB semiclassical expansion for the eigenvalues. However, all examples studied to date have potentials with only real classical turning points. But systems with complex turning points generate subdominant terms beyond those in the WKB series. The simplest case… ▽ More Recent work has shown a deep connection between semilocal approximations in density functional theory and the asymptotics of the sum of the WKB semiclassical expansion for the eigenvalues. However, all examples studied to date have potentials with only real classical turning points. But systems with complex turning points generate subdominant terms beyond those in the WKB series. The simplest case is a pure quartic oscillator. We show how to generalize the asymptotics of eigenvalue sums to include subdominant contributions to the sums, if they are known for the eigenvalues. These corrections to WKB greatly improve accuracy for eigenvalue sums, especially for many levels. We obtain further improvements to the sums through hyperasymptotics. For the lowest level, our summation method has error below $2 \times 10^{-4}$. For the sum of the lowest 10 levels, our error is less than $10^{-22}$. We report all results to many digits and include copious details of the asymptotic expansions and their derivation. △ Less

Submitted 12 July, 2022; v1 submitted 9 April, 2022; originally announced April 2022.

arXiv:2204.01030 [pdf, other]

doi 10.1103/PhysRevLett.129.153001

Leading correction to the local density approximation for exchange in large-$Z$ atoms

Authors: Nathan Argaman, Jeremy Redd, Antonio C. Cancio, Kieron Burke

Abstract: The large-$Z$ asymptotic expansion of atomic energies has been useful in determining exact conditions for corrections to the local density approximation in density functional theory. The correction for exchange is fit well with a leading $Z \ln Z$ term, and we find its coefficient numerically. The gradient expansion approximation also has such a term, but with a smaller coefficient. Analytic resul… ▽ More The large-$Z$ asymptotic expansion of atomic energies has been useful in determining exact conditions for corrections to the local density approximation in density functional theory. The correction for exchange is fit well with a leading $Z \ln Z$ term, and we find its coefficient numerically. The gradient expansion approximation also has such a term, but with a smaller coefficient. Analytic results in the limit of vanishing interaction with hydrogenic orbitals (a Bohr atom) lead to the conjecture that the coefficients are precisely 2.7 times larger than their gradient expansion counterparts, yielding an analytic expression for the exchange-energy correction which is accurate to $\sim 5\%$ for all $Z$. △ Less

Submitted 12 April, 2023; v1 submitted 3 April, 2022; originally announced April 2022.

Comments: Published version. 6 pages, 3 figures + 5 page supplement

Journal ref: Phys. Rev. Lett. 129, 153001 (2022)

arXiv:2203.09647 [pdf, other]

doi 10.1103/PhysRevB.105.245138

Conditional probability density functional theory

Authors: Ryan Pederson, Jielun Chen, Steven R. White, Kieron Burke

Abstract: We present conditional probability (CP) density functional theory (DFT) as a formally exact theory. In essence, CP-DFT determines the ground-state energy of a system by finding the CP density from a series of independent Kohn-Sham (KS) DFT calculations. By directly calculating CP densities, we bypass the need for an approximate XC energy functional. In this work we discuss and derive several key p… ▽ More We present conditional probability (CP) density functional theory (DFT) as a formally exact theory. In essence, CP-DFT determines the ground-state energy of a system by finding the CP density from a series of independent Kohn-Sham (KS) DFT calculations. By directly calculating CP densities, we bypass the need for an approximate XC energy functional. In this work we discuss and derive several key properties of the CP density and corresponding CP-KS potential. Illustrative examples are used throughout to help guide the reader through the various concepts and theory presented. We explore a suitable CP-DFT approximation and discuss exact conditions, limitations, and results for selected examples. △ Less

Submitted 24 June, 2022; v1 submitted 17 March, 2022; originally announced March 2022.

Journal ref: Phys. Rev. B 105, 245138 (2022)

arXiv:2203.06863 [pdf, other]

Improving results by improving densities: Density-corrected density functional theory

Authors: Eunji Sim, Suhwan Song, Stefan Vuckovic, Kieron Burke

Abstract: DFT calculations have become widespread in both chemistry and materials, because they usually provide useful accuracy at much lower computational cost than wavefunction-based methods. All practical DFT calculations require an approximation to the unknown exchange-correlation energy, which is then used self-consistently in the Kohn-Sham scheme to produce an approximate energy from an approximate de… ▽ More DFT calculations have become widespread in both chemistry and materials, because they usually provide useful accuracy at much lower computational cost than wavefunction-based methods. All practical DFT calculations require an approximation to the unknown exchange-correlation energy, which is then used self-consistently in the Kohn-Sham scheme to produce an approximate energy from an approximate density. Density-corrected DFT is simply the study of the relative contributions to the total energy error. In the vast majority of DFT calculations, the error due to the approximate density is negligible. But with certain classes of functionals applied to certain classes of problems, the density error is sufficiently large as to contribute to the energy noticeably, and its removal leads to much better results. These problems include reaction barriers, torsional barriers involving $π$-conjugation, halogen bonds, radicals and anions, most stretched bonds, etc. In all such cases, use of a more accurate density significantly improves performance, and often the simple expedient of using the Hartree-Fock density is enough. This article explains what DC-DFT is, where it is likely to improve results, and how DC-DFT can produce more accurate functionals. We also outline challenges and prospects for the field. △ Less

Submitted 14 March, 2022; originally announced March 2022.

arXiv:2202.05361 [pdf, ps, other]

Lieb's most useful contribution to density functional theory?

Authors: Kieron Burke

Abstract: The importance of the Lieb-Simon proof of the relative exactness of Thomas-Fermi theory in the large-Z limit to modern density functional theory (DFT) is explored. The principle, that there is a specific semiclassical limit in which functionals become local, implies that there exist well-defined leading functional corrections to local approximations that become relatively exact for the error in lo… ▽ More The importance of the Lieb-Simon proof of the relative exactness of Thomas-Fermi theory in the large-Z limit to modern density functional theory (DFT) is explored. The principle, that there is a specific semiclassical limit in which functionals become local, implies that there exist well-defined leading functional corrections to local approximations that become relatively exact for the error in local approximations in this limit. It is argued that this principle might be used to greatly improve the accuracy of the thousand or so DFT calculations that are now published each week. A key question is how to find the leading corrections to any local density approximation as this limit is approached. These corrections have been explicitly derived in ridiculously simple model systems to ridiculously high order, yielding ridiculously accurate energies. Much analytic work is needed to use this principle to improve realistic calculations of molecules and solids. △ Less

Submitted 10 February, 2022; originally announced February 2022.

arXiv:2201.06465 [pdf, other]

doi 10.1109/SWC50871.2021.00098

Process Visualization of Manufacturing Execution System (MES) Data

Authors: Meadhbh O'Neill, Jeff Morgan, Kevin Burke

Abstract: Process visualizations of data from manufacturing execution systems (MESs) provide the ability to generate valuable insights for improved decision-making. Industry 4.0 is awakening a digital transformation where advanced analytics and visualizations are critical. Exploiting MESs with data-driven strategies can have a major impact on business outcomes. The advantages of employing process visualizat… ▽ More Process visualizations of data from manufacturing execution systems (MESs) provide the ability to generate valuable insights for improved decision-making. Industry 4.0 is awakening a digital transformation where advanced analytics and visualizations are critical. Exploiting MESs with data-driven strategies can have a major impact on business outcomes. The advantages of employing process visualizations are demonstrated through an application to real-world data. Visualizations, such as dashboards, enable the user to examine the performance of a production line at a high level. Furthermore, the addition of interactivity facilitates the user to customize the data they want to observe. Evidence of process variability between shifts and days of the week can be investigated with the goal of optimizing production. △ Less

Submitted 17 January, 2022; originally announced January 2022.

MSC Class: 62P30

Journal ref: 2021 IEEE SmartWorld, Ubiquitous Intelligence & Computing, Advanced & Trusted Computing, Scalable Computing & Communications, Internet of People and Smart City Innovation (SmartWorld/SCALCOM/UIC/ATC/IOP/SCI) (2021) 659-664

arXiv:2201.06386 [pdf, other]

Visual Identification of Problematic Bias in Large Label Spaces

Authors: Alex Bäuerle, Aybuke Gul Turker, Ken Burke, Osman Aka, Timo Ropinski, Christina Greer, Mani Varadarajan

Abstract: While the need for well-trained, fair ML systems is increasing ever more, measuring fairness for modern models and datasets is becoming increasingly difficult as they grow at an unprecedented pace. One key challenge in scaling common fairness metrics to such models and datasets is the requirement of exhaustive ground truth labeling, which cannot always be done. Indeed, this often rules out the app… ▽ More While the need for well-trained, fair ML systems is increasing ever more, measuring fairness for modern models and datasets is becoming increasingly difficult as they grow at an unprecedented pace. One key challenge in scaling common fairness metrics to such models and datasets is the requirement of exhaustive ground truth labeling, which cannot always be done. Indeed, this often rules out the application of traditional analysis metrics and systems. At the same time, ML-fairness assessments cannot be made algorithmically, as fairness is a highly subjective matter. Thus, domain experts need to be able to extract and reason about bias throughout models and datasets to make informed decisions. While visual analysis tools are of great help when investigating potential bias in DL models, none of the existing approaches have been designed for the specific tasks and challenges that arise in large label spaces. Addressing the lack of visualization work in this area, we propose guidelines for designing visualizations for such large label spaces, considering both technical and ethical issues. Our proposed visualization approach can be integrated into classical model and data pipelines, and we provide an implementation of our techniques open-sourced as a TensorBoard plug-in. With our approach, different models and datasets for large label spaces can be systematically and visually analyzed and compared to make informed fairness assessments tackling problematic bias. △ Less

Submitted 17 January, 2022; originally announced January 2022.

arXiv:2201.04973 [pdf, ps, other]

Æquitas: Two-Player Counterfeit Coin Games

Authors: Kyle Burke, Tanya Khovanova, Joshua Lee, Richard J. Nowakowski, Amelia Rowland, Craig Tennenhouse

Abstract: We discuss games involving a counterfeit coin. Given one counterfeit coin among a number of otherwise identical coins, two players with full knowledge of the fake coin take turns weighing coins on a two-pan scale, under the condition that on every turn they reveal some information to an Observer with limited prior knowledge about the coins. We study several games depending on the types of the coun… ▽ More We discuss games involving a counterfeit coin. Given one counterfeit coin among a number of otherwise identical coins, two players with full knowledge of the fake coin take turns weighing coins on a two-pan scale, under the condition that on every turn they reveal some information to an Observer with limited prior knowledge about the coins. We study several games depending on the types of the counterfeit coin, the end-game condition, and the availability of extra genuine coins. We present the winning positions and the Grundy numbers for these games. △ Less

Submitted 12 January, 2022; originally announced January 2022.

Comments: 11 pages

MSC Class: 91A46

arXiv:2201.02599 [pdf, other]

doi 10.1103/PhysRevB.105.165143

Correlation energy of the uniform gas determined by ground state conditional probability density functional theory

Authors: Dennis Perchak, Ryan J. McCarty, Kieron Burke

Abstract: Conditional-probability density functional theory (CP-DFT) is a formally exact method for finding correlation energies from Kohn-Sham DFT without evaluating an explicit energy functional. We present details on how to generate accurate exchange-correlation energies for the ground-state uniform gas. We also use the exchange hole in a CP antiparallel spin calculation to extract the high-density limit… ▽ More Conditional-probability density functional theory (CP-DFT) is a formally exact method for finding correlation energies from Kohn-Sham DFT without evaluating an explicit energy functional. We present details on how to generate accurate exchange-correlation energies for the ground-state uniform gas. We also use the exchange hole in a CP antiparallel spin calculation to extract the high-density limit. We give a highly accurate analytic solution to the Thomas-Fermi model for this problem, showing its performance relative to Kohn-Sham and may be useful at high temperatures. We explore several approximations to the CP potential. Results are compared to accurate parameterizations for both exchange-correlation energies and holes. △ Less

Submitted 3 February, 2022; v1 submitted 7 January, 2022; originally announced January 2022.

Comments: Corrected typos, minor changes in text and notation for clarity and continuity with other work, results unchanged

arXiv:2112.05554 [pdf, other]

Using Machine Learning to Find New Density Functionals

Authors: Bhupalee Kalita, Kieron Burke

Abstract: Machine learning has now become an integral part of research and innovation. The field of machine learning density functional theory has continuously expanded over the years while making several noticeable advances. We briefly discuss the status of this field and point out some current and future challenges. We also talk about how state-of-the-art science and technology tools can help overcome the… ▽ More Machine learning has now become an integral part of research and innovation. The field of machine learning density functional theory has continuously expanded over the years while making several noticeable advances. We briefly discuss the status of this field and point out some current and future challenges. We also talk about how state-of-the-art science and technology tools can help overcome these challenges. This draft is a part of the "Roadmap on Machine Learning in Electronic Structure" to be published in Electronic Structure (EST). △ Less

Submitted 3 December, 2021; originally announced December 2021.

arXiv:2111.08573 [pdf, other]

Multi-Parameter Regression Survival Modelling with Random Effects

Authors: Fatima-Zahra Jaouimaa, Il Do Ha, Kevin Burke

Abstract: We consider a parametric modelling approach for survival data where covariates are allowed to enter the model through multiple distributional parameters, i.e., scale and shape. This is in contrast with the standard convention of having a single covariate-dependent parameter, typically the scale. Taking what is referred to as a multi-parameter regression (MPR) approach to modelling has been shown t… ▽ More We consider a parametric modelling approach for survival data where covariates are allowed to enter the model through multiple distributional parameters, i.e., scale and shape. This is in contrast with the standard convention of having a single covariate-dependent parameter, typically the scale. Taking what is referred to as a multi-parameter regression (MPR) approach to modelling has been shown to produce flexible and robust models with relatively low model complexity cost. However, it is very common to have clustered data arising from survival analysis studies, and this is something that is under developed in the MPR context. The purpose of this article is to extend MPR models to handle multivariate survival data by introducing random effects in both the scale and the shape regression components. We consider a variety of possible dependence structures for these random effects (independent, shared, and correlated), and estimation proceeds using a h-likelihood approach. The performance of our estimation procedure is investigated by a way of an extensive simulation study, and the merits of our modelling approach are illustrated through applications to two real data examples, a lung cancer dataset and a bladder cancer dataset. △ Less

Submitted 16 November, 2021; originally announced November 2021.

Comments: 31 pages, 5 figures and 9 tables

MSC Class: 62N01; 62N02; 62F99

arXiv:2110.14846 [pdf, other]

doi 10.1021/acs.jpclett.2c00371

How Well Does Kohn-Sham Regularizer Work for Weakly Correlated Systems?

Authors: Bhupalee Kalita, Ryan Pederson, Jielun Chen, Li Li, Kieron Burke

Abstract: Kohn-Sham regularizer (KSR) is a differentiable machine learning approach to finding the exchange-correlation functional in Kohn-Sham density functional theory (DFT) that works for strongly correlated systems. Here we test KSR for weak correlation. We propose spin-adapted KSR (sKSR) with trainable local, semilocal, and nonlocal approximations found by minimizing density and total energy loss. We a… ▽ More Kohn-Sham regularizer (KSR) is a differentiable machine learning approach to finding the exchange-correlation functional in Kohn-Sham density functional theory (DFT) that works for strongly correlated systems. Here we test KSR for weak correlation. We propose spin-adapted KSR (sKSR) with trainable local, semilocal, and nonlocal approximations found by minimizing density and total energy loss. We assess the atoms-to-molecules generalizability by training on one-dimensional (1D) H, He, Li, Be, Be$^{++}$ and testing on 1D hydrogen chains, LiH, BeH$_2$, and helium hydride complexes. The generalization error from our semilocal approximation is comparable to other differentiable approaches, but our nonlocal functional outperforms any existing machine learning functionals, predicting ground-state energies of test systems with a mean absolute error of 2.7 milli-Hartrees. △ Less

Submitted 21 March, 2022; v1 submitted 27 October, 2021; originally announced October 2021.

arXiv:2110.07849 [pdf, other]

Density-corrected DFT explained: Questions and answers

Authors: Suhwan Song, Stefan Vuckovic, Eunji Sim, Kieron Burke

Abstract: HF-DFT, the practice of evaluating approximate density functionals on Hartree-Fock densities, has long been used in testing density functional approximations. Density-corrected DFT (DC-DFT) is a general theoretical framework for identifying failures of density functional approximations by separating errors in a functional from errors in its self-consistent (SC) density. Most modern DFT calculation… ▽ More HF-DFT, the practice of evaluating approximate density functionals on Hartree-Fock densities, has long been used in testing density functional approximations. Density-corrected DFT (DC-DFT) is a general theoretical framework for identifying failures of density functional approximations by separating errors in a functional from errors in its self-consistent (SC) density. Most modern DFT calculations yield highly accurate densities, but important characteristic classes of calculation have large density-driven errors, including reaction barrier heights, electron affinities, radicals and anions in solution, dissociation of heterodimers, and even some torsional barriers. Here, the HF density (if not spin-contaminated) usually yields more accurate and consistent energies than those of the SC density. We use the term DC(HF)-DFT to indicate DC-DFT using HF densities only in such cases. A recent comprehensive study (J. Chem. Theory Comput. 2021, 17, 1368$-$1379) of HF-DFT led to many unfavorable conclusions. A re-analysis using DC-DFT shows that DC(HF)-DFT substantially improves DFT results precisely when SC densities are flawed. △ Less

Submitted 15 October, 2021; originally announced October 2021.

arXiv:2110.02643 [pdf, other]

Variable Selection Using a Smooth Information Criterion for Distributional Regression Models

Authors: Meadhbh O'Neill, Kevin Burke

Abstract: Modern variable selection procedures make use of penalization methods to execute simultaneous model selection and estimation. A popular method is the LASSO (least absolute shrinkage and selection operator), the use of which requires selecting the value of a tuning parameter. This parameter is typically tuned by minimizing the cross-validation error or Bayesian information criterion (BIC) but this… ▽ More Modern variable selection procedures make use of penalization methods to execute simultaneous model selection and estimation. A popular method is the LASSO (least absolute shrinkage and selection operator), the use of which requires selecting the value of a tuning parameter. This parameter is typically tuned by minimizing the cross-validation error or Bayesian information criterion (BIC) but this can be computationally intensive as it involves fitting an array of different models and selecting the best one. In contrast with this standard approach, we have developed a procedure based on the so-called "smooth IC" (SIC) in which the tuning parameter is automatically selected in one step. We also extend this model selection procedure to the distributional regression framework, which is more flexible than classical regression modelling. Distributional regression, also known as multiparameter regression (MPR), introduces flexibility by taking account of the effect of covariates through multiple distributional parameters simultaneously, e.g., mean and variance. These models are useful in the context of normal linear regression when the process under study exhibits heteroscedastic behaviour. Reformulating the distributional regression estimation problem in terms of penalized likelihood enables us to take advantage of the close relationship between model selection criteria and penalization. Utilizing the SIC is computationally advantageous, as it obviates the issue of having to choose multiple tuning parameters. △ Less

Submitted 11 January, 2023; v1 submitted 6 October, 2021; originally announced October 2021.

MSC Class: 62J07

arXiv:2109.05622 [pdf, ps, other]

Nimber-Preserving Reductions and Homomorphic Sprague-Grundy Game Encodings

Authors: Kyle Burke, Matthew Ferland, Shanghua Teng

Abstract: The concept of nimbers--a.k.a. Grundy-values or nim-values--is fundamental to combinatorial game theory. Nimbers provide a complete characterization of strategic interactions among impartial games in their disjunctive sums as well as the winnability. In this paper, we initiate a study of nimber-preserving reductions among impartial games. These reductions enhance the winnability-preserving reducti… ▽ More The concept of nimbers--a.k.a. Grundy-values or nim-values--is fundamental to combinatorial game theory. Nimbers provide a complete characterization of strategic interactions among impartial games in their disjunctive sums as well as the winnability. In this paper, we initiate a study of nimber-preserving reductions among impartial games. These reductions enhance the winnability-preserving reductions in traditional computational characterizations of combinatorial games. We prove that Generalized Geography is complete for the natural class, $\cal{I}^P$ , of polynomially-short impartial rulesets under nimber-preserving reductions, a property we refer to as Sprague-Grundy-complete. In contrast, we also show that not every PSPACE-complete ruleset in $\cal{I}^P$ is Sprague-Grundy-complete for $\cal{I}^P$ . By considering every impartial game as an encoding of its nimber, our technical result establishes the following striking cryptography-inspired homomorphic theorem: Despite the PSPACE-completeness of nimber computation for $\cal{I}^P$ , there exists a polynomial-time algorithm to construct, for any pair of games $G_1$, $G_2$ of $\cal{I}^P$ , a prime game (i.e. a game that cannot be written as a sum) $H$ of $\cal{I}^P$ , satisfying: nimber($H$) = nimber($G_1$) $\oplus$ nimber($G_2$). △ Less

Submitted 22 February, 2022; v1 submitted 12 September, 2021; originally announced September 2021.

Comments: 15 Pages

MSC Class: 91A46 ACM Class: F.1.3; F.2.2; G.2.1; G.2.2

arXiv:2109.02516 [pdf, other]

Binomial confidence intervals for rare events: importance of defining margin of error relative to magnitude of proportion

Authors: Owen McGrath, Kevin Burke

Abstract: Confidence interval performance is typically assessed in terms of two criteria: coverage probability and interval width (or margin of error). In this paper, we assess the performance of four common proportion interval estimators: the Wald, Clopper-Pearson (exact), Wilson and Agresti-Coull, in the context of rare-event probabilities. We define the interval precision in terms of a relative margin of… ▽ More Confidence interval performance is typically assessed in terms of two criteria: coverage probability and interval width (or margin of error). In this paper, we assess the performance of four common proportion interval estimators: the Wald, Clopper-Pearson (exact), Wilson and Agresti-Coull, in the context of rare-event probabilities. We define the interval precision in terms of a relative margin of error which ensures consistency with the magnitude of the proportion. Thus, confidence interval estimators are assessed in terms of achieving a desired coverage probability whilst simultaneously satisfying the specified relative margin of error. We illustrate the importance of considering both coverage probability and relative margin of error when estimating rare-event proportions, and show that within this framework, all four interval estimators perform somewhat similarly for a given sample size and confidence level. We identify relative margin of error values that result in satisfactory coverage whilst being conservative in terms of sample size requirements, and hence suggest a range of values that can be adopted in practice. The proposed relative margin of error scheme is evaluated analytically, by simulation, and by application to a number of recent studies from the literature. △ Less

Submitted 15 January, 2024; v1 submitted 6 September, 2021; originally announced September 2021.

Comments: 25 pages, 2 figures

MSC Class: 62F25

arXiv:2108.11534 [pdf, other]

Lies My Teacher Told Me About Density Functional Theory: Seeing Through Them with the Hubbard Dimer

Authors: Kieron Burke, John Kozlowski

Abstract: Most realistic calculations of moderately correlated materials begin with a ground-state density functional theory (DFT) calculation. While Kohn-Sham DFT is used in about 40,000 scientific papers each year, the fundamental underpinnings are not widely appreciated. In this chapter, we analyze the inherent characteristics of DFT in their simplest form, using the asymmetric Hubbard dimer as an illust… ▽ More Most realistic calculations of moderately correlated materials begin with a ground-state density functional theory (DFT) calculation. While Kohn-Sham DFT is used in about 40,000 scientific papers each year, the fundamental underpinnings are not widely appreciated. In this chapter, we analyze the inherent characteristics of DFT in their simplest form, using the asymmetric Hubbard dimer as an illustrative model. We begin by working through the core tenets of DFT, explaining what the exact ground-state density functional yields and does not yield. Given the relative simplicity of the system, almost all properties of the exact exchange-correlation functional are readily visualized and plotted. Key concepts include the Kohn-Sham scheme, the behavior of the XC potential as correlations become very strong, the derivative discontinuity and the difference between KS gaps and true charge gaps, and how to extract optical excitations using time-dependent DFT. By the end of this text and accompanying exercises, the reader will improve their ability to both explain and visualize the concepts of DFT, as well as better understand where others may go wrong. △ Less

Submitted 22 September, 2022; v1 submitted 25 August, 2021; originally announced August 2021.

Comments: This chapter appears in the book "Autumn School on Correlated Electrons: Simulating Correlations with Computers" (2021) prepared by Forschungszentrum Jülich; see http://hdl.handle.net/2128/28665 for published chapter. Updated versions correct citations

arXiv:2106.06377 [pdf, other]

doi 10.1371/journal.pone.0260632

An age-structured SEIR model for COVID--19 incidence in Dublin, Ireland with framework for evaluating health intervention cost

Authors: Fatima-Zahra Jaouimaa, Daniel Dempsey, Suzanne van Osch, Stephen Kinsella, Kevin Burke, Jason Wyse, James Sweeney

Abstract: Strategies adopted globally to mitigate the threat of COVID-19 have primarily involved lockdown measures with substantial economic and social costs with varying degrees of success. Morbidity patterns of COVID-19 variants have a strong association with age, while restrictive lockdown measures have association with negative mental health outcomes in some age groups. Reduced economic prospects may al… ▽ More Strategies adopted globally to mitigate the threat of COVID-19 have primarily involved lockdown measures with substantial economic and social costs with varying degrees of success. Morbidity patterns of COVID-19 variants have a strong association with age, while restrictive lockdown measures have association with negative mental health outcomes in some age groups. Reduced economic prospects may also afflict some age cohorts more than others. Motivated by this, we propose a model to describe COVID-19 community spread incorporating the role of age-specific social interactions. Through a flexible parameterisation of an age-structured deterministic Susceptible Exposed Infectious Removed (SEIR) model, we provide a means for characterising different forms of lockdown which may impact specific age groups differently. Social interactions are represented through age group to age group contact matrices, which can be trained using available data and are thus locally adapted. This framework is easy to interpret and suitable for describing counterfactual scenarios, which could assist policy makers with regard to minimising morbidity balanced with the costs of prospective suppression strategies. Our work originates from an Irish context and we use disease monitoring data from February 29th 2020 to January 31st 2021 gathered by Irish governmental agencies. We demonstrate how Irish lockdown scenarios can be constructed using the proposed model formulation and show results of retrospective fitting to incidence rates and forward planning with relevant ``what if/instead of'' lockdown counterfactuals with uncertainty quantification. Our formulation is agnostic to a specific locale, in that lockdown strategies in other regions can be straightforwardly encoded using this model. The methods we describe are made publicly available online through an accessible and easy to use web interface. △ Less

Submitted 11 June, 2021; originally announced June 2021.

arXiv:2106.02114 [pdf, ps, other]

Winning the War by (Strategically) Losing Battles: Settling the Complexity of Grundy-Values in Undirected Geography

Authors: Kyle Burke, Matthew Ferland, Shanghua Teng

Abstract: We settle two long-standing complexity-theoretical questions-open since 1981 and 1993-in combinatorial game theory (CGT). We prove that the Grundy value (a.k.a. nim-value, or nimber) of Undirected Geography is PSPACE-complete to compute. This exhibits a stark contrast with a result from 1993 that Undirected Geography is polynomial-time solvable. By distilling to a simple reduction, our proof fur… ▽ More We settle two long-standing complexity-theoretical questions-open since 1981 and 1993-in combinatorial game theory (CGT). We prove that the Grundy value (a.k.a. nim-value, or nimber) of Undirected Geography is PSPACE-complete to compute. This exhibits a stark contrast with a result from 1993 that Undirected Geography is polynomial-time solvable. By distilling to a simple reduction, our proof further establishes a dichotomy theorem, providing a "phase transition to intractability" in Grundy-value computation, sharply characterized by a maximum degree of four: The Grundy value of Undirected Geography over any degree-three graph is polynomial-time computable, but over degree-four graphs-even when planar and bipartite-is PSPACE-hard. Additionally, we show, for the first time, how to construct Undirected Geography instances with Grundy value $\ast n$ and size polynomial in n. We strengthen a result from 1981 showing that sums of tractable partisan games are PSPACE-complete in two fundamental ways. First, since Undirected Geography is an impartial ruleset, we extend the hardness of sums to impartial games, a strict subset of partisan. Second, the 1981 construction is not built from a natural ruleset, instead using a long sum of tailored short-depth game positions. We use the sum of two Undirected Geography positions to create our hard instances. Our result also has computational implications to Sprague-Grundy Theory (1930s) which shows that the Grundy value of the disjunctive sum of any two impartial games can be computed-in polynomial time-from their Grundy values. In contrast, we prove that assuming PSPACE $\neq$ P, there is no general polynomial-time method to summarize two polynomial-time solvable impartial games to efficiently solve their disjunctive sum. △ Less

Submitted 3 June, 2021; originally announced June 2021.

MSC Class: 91A46 ACM Class: F.1.3; F.2.2; G.2.1; G.2.2

arXiv:2105.04384 [pdf, other]

Semiclassics: The hidden theory behind the success of DFT

Authors: Pavel Okun, Kieron Burke

Abstract: We argue that the success of DFT can be understood in terms of a semiclassical expansion around a very specific limit. This limit was identified long ago by Lieb and Simon for the total electronic energy of a system. This is a universal limit of all electronic structure: atoms, molecules, and solids. For the total energy, Thomas-Fermi theory becomes relatively exact in the limit. The limit can als… ▽ More We argue that the success of DFT can be understood in terms of a semiclassical expansion around a very specific limit. This limit was identified long ago by Lieb and Simon for the total electronic energy of a system. This is a universal limit of all electronic structure: atoms, molecules, and solids. For the total energy, Thomas-Fermi theory becomes relatively exact in the limit. The limit can also be studied for much simpler model systems, including non-interacting fermions in a one-dimensional well, where the WKB approximation applies for individual eigenvalues and eigenfunctions. Summation techniques lead to energies and densities that are functionals of the potential. We consider several examples in one dimension (fermions in a box, in a harmonic well, in a linear half-well, and in the Pöschl-Teller well. The effects of higher dimension are also illustrated with the three-dimensional harmonic well and the Bohr atom, non-interacting fermions in a Coulomb well. Modern density functional calculations use the Kohn-Sham scheme almost exclusively. The same semiclassical limit can be studied for the Kohn-Sham kinetic energy, for the exchange energy, and for the correlation energy. For all three, the local density approximation appears to become relatively exact in this limit. Recent work, both analytic and numerical, explores how this limit is approached, in an effort to deduce the leading corrections to the local approximation. A simple scheme, using the Euler-Maclaurin summation formula, is the result of many different attempts at this problem. In very simple cases, the correction formulas are much more accurate than standard density functionals. Several functionals are already in widespread use in both chemistry and materials that incorporate these limits, and prospects for the future are discussed. △ Less

Submitted 17 May, 2021; v1 submitted 10 May, 2021; originally announced May 2021.

arXiv:2104.11802 [pdf]

doi 10.3847/PSJ/abfbe2

Characterization of Exogenic Boulders on Near-Earth Asteroid (101955) Bennu from OSIRIS-REx Color Images

Authors: Lucille Le Corre, Vishnu Reddy, William F. Bottke, Daniella N. DellaGiustina, Keara Burke, Jennifer Nolau, Robin B. Van Auken, Dathon R. Golish, Juan Sanchez, Jian-Yang Li, Christian d'Aubigny, Bashar Rizk, Dante Lauretta

Abstract: A small number of anomalously bright boulders on the near-Earth, rubble-pile asteroid (101955) Bennu were recently identified as eucritic material originating from asteroid (4) Vesta. Building on this discovery, we explored the global presence of exogenic boulders on Bennu. Our analysis focused on boulders larger than 1 m that show the characteristic 1-micron pyroxene absorption band in the four-c… ▽ More A small number of anomalously bright boulders on the near-Earth, rubble-pile asteroid (101955) Bennu were recently identified as eucritic material originating from asteroid (4) Vesta. Building on this discovery, we explored the global presence of exogenic boulders on Bennu. Our analysis focused on boulders larger than 1 m that show the characteristic 1-micron pyroxene absorption band in the four-color MapCam data from the OSIRIS-REx mission. We confirm the presence of exogenic boulders similar to eucrites and find that mixtures of eucrites with carbonaceous material is also a possible composition for some boulders. Some of the exogenic boulders have spectral properties similar to those of ordinary chondrite (OC) meteorites, although the laboratory spectra of these meteorites have a higher albedo than those measured on Bennu, which could be explained by either a grain size effect, the presence of impact melt, or optical mixing with carbonaceous material owing to dust coating. Our Monte Carlo simulations predict that the median amount of OC mass added to the parent body of Bennu is 0.055% and 0.037% of the volume of a 100- and 200-km-diameter parent body, respectively. If Bennu was a uniformly mixed byproduct of parent body and S-type projectiles, the equivalent mass of OC material would be a sphere with a diameter of 36 to 40 m (or a volume of 24,200 to 33,600 m3). The total amount of OC material in the interior of Bennu estimated from the MapCam data is slightly higher (91,000-150,000 m3). △ Less

Submitted 23 April, 2021; originally announced April 2021.

Comments: 36 pages, 11 figures, 2 tables

Journal ref: Planet. Sci. J. 2 114 (2021)

arXiv:2103.03417 [pdf, other]

doi 10.1145/3461702.3462557

Measuring Model Biases in the Absence of Ground Truth

Authors: Osman Aka, Ken Burke, Alex Bäuerle, Christina Greer, Margaret Mitchell

Abstract: The measurement of bias in machine learning often focuses on model performance across identity subgroups (such as man and woman) with respect to groundtruth labels. However, these methods do not directly measure the associations that a model may have learned, for example between labels and identity subgroups. Further, measuring a model's bias requires a fully annotated evaluation dataset which may… ▽ More The measurement of bias in machine learning often focuses on model performance across identity subgroups (such as man and woman) with respect to groundtruth labels. However, these methods do not directly measure the associations that a model may have learned, for example between labels and identity subgroups. Further, measuring a model's bias requires a fully annotated evaluation dataset which may not be easily available in practice. We present an elegant mathematical solution that tackles both issues simultaneously, using image classification as a working example. By treating a classification model's predictions for a given image as a set of labels analogous to a bag of words, we rank the biases that a model has learned with respect to different identity labels. We use (man, woman) as a concrete example of an identity label set (although this set need not be binary), and present rankings for the labels that are most biased towards one identity or the other. We demonstrate how the statistical properties of different association metrics can lead to different rankings of the most "gender biased" labels, and conclude that normalized pointwise mutual information (nPMI) is most useful in practice. Finally, we announce an open-sourced nPMI visualization tool using TensorBoard. △ Less

Submitted 6 June, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

arXiv:2102.06842 [pdf, other]

Explaining and Fixing DFT Failures for Torsional Barriers

Authors: Seungsoo Nam, Eunbyol Cho, Kieron Burke, Eunji Sim

Abstract: Most torsional barriers are predicted to high accuracy (about 1kJ/mol) by standard semilocal functionals, but a small subset has been found to have much larger errors. We create a database of almost 300 carbon-carbon torsional barriers, including 12 poorly behaved barriers, all stemming from Y=C-X group, where X is O or S, and Y is a halide. Functionals with enhanced exchange mixing (about 50%) wo… ▽ More Most torsional barriers are predicted to high accuracy (about 1kJ/mol) by standard semilocal functionals, but a small subset has been found to have much larger errors. We create a database of almost 300 carbon-carbon torsional barriers, including 12 poorly behaved barriers, all stemming from Y=C-X group, where X is O or S, and Y is a halide. Functionals with enhanced exchange mixing (about 50%) work well for all barriers. We find that poor actors have delocalization errors caused by hyperconjugation. These problematic calculations are density sensitive (i.e., DFT predictions change noticeably with the density), and using HF densities (HF-DFT) fixes these issues. For example, conventional B3LYP performs as accurately as exchange-enhanced functionals if the HF density is used. For long-chain conjugated molecules, HF-DFT can be much better than exchange-enhanced functionals. We suggest that HF-PBE0 has the best overall performance. △ Less

Submitted 12 February, 2021; originally announced February 2021.

Comments: 13 pages, 7 main figures, 6 supporting figures

arXiv:2101.07237 [pdf, ps, other]

Transverse Wave: an impartial color-propagation game inspired by Social Influence and Quantum Nim

Authors: Kyle Burke, Matthew Ferland, Shanghua Teng

Abstract: In this paper, we study a colorful, impartial combinatorial game played on a two-dimensional grid, Transverse Wave. We are drawn to this game because of its apparent simplicity, contrasting intractability, and intrinsic connection to two other combinatorial games, one inspired by social influence and another inspired by quantum superpositions. More precisely, we show that Transverse Wave is at t… ▽ More In this paper, we study a colorful, impartial combinatorial game played on a two-dimensional grid, Transverse Wave. We are drawn to this game because of its apparent simplicity, contrasting intractability, and intrinsic connection to two other combinatorial games, one inspired by social influence and another inspired by quantum superpositions. More precisely, we show that Transverse Wave is at the intersection of social-influence-inspired Friend Circle and superposition-based Demi-Quantum Nim. Transverse Wave is also connected with Schaefer's logic game Avoid True. In addition to analyzing the mathematical structures and computational complexity of Transverse Wave, we provide a web-based version of the game, playable at https://turing.plymouth.edu/~kgb1013/DB/combGames/transverseWave.html. Furthermore, we formulate a basic network-influence inspired game, called Demographic Influence, which simultaneously generalizes Node-Kyles and Demi-Quantum Nim (which in turn contains as special cases Nim, Avoid True, and Transverse Wave). These connections illuminate the lattice order, induced by special-case/generalization relationships over mathematical games, fundamental to both the design and comparative analyses of combinatorial games. △ Less

Submitted 19 January, 2021; v1 submitted 18 January, 2021; originally announced January 2021.

Comments: 29 Pages

MSC Class: 91A46 ACM Class: G.2.1; G.2.2; F.1.3

Showing 1–50 of 200 results for author: Burke, K