-
Sequential Inference of Hospitalization Electronic Health Records Using Probabilistic Models
Authors:
Alan D. Kaplan,
Priyadip Ray,
John D. Greene,
Vincent X. Liu
Abstract:
In the dynamic hospital setting, decision support can be a valuable tool for improving patient outcomes. Data-driven inference of future outcomes is challenging in this dynamic setting, where long sequences such as laboratory tests and medications are updated frequently. This is due in part to heterogeneity of data types and mixed-sequence types contained in variable length sequences. In this work…
▽ More
In the dynamic hospital setting, decision support can be a valuable tool for improving patient outcomes. Data-driven inference of future outcomes is challenging in this dynamic setting, where long sequences such as laboratory tests and medications are updated frequently. This is due in part to heterogeneity of data types and mixed-sequence types contained in variable length sequences. In this work we design a probabilistic unsupervised model for multiple arbitrary-length sequences contained in hospitalization Electronic Health Record (EHR) data. The model uses a latent variable structure and captures complex relationships between medications, diagnoses, laboratory tests, neurological assessments, and medications. It can be trained on original data, without requiring any lossy transformations or time binning. Inference algorithms are derived that use partial data to infer properties of the complete sequences, including their length and presence of specific values. We train this model on data from subjects receiving medical care in the Kaiser Permanente Northern California integrated healthcare delivery system. The results are evaluated against held-out data for predicting the length of sequences and presence of Intensive Care Unit (ICU) in hospitalization bed sequences. Our method outperforms a baseline approach, showing that in these experiments the trained model captures information in the sequences that is informative of their future values.
△ Less
Submitted 24 April, 2024; v1 submitted 27 March, 2024;
originally announced March 2024.
-
Walsh coefficients and circuits for several alleles
Authors:
Kristina Crona,
Devin Greene
Abstract:
Walsh coefficients have been applied extensively to biallelic systems for quantifying pairwise and higher order epistasis, in particular for demonstrating the empirical importance of higher order interactions. Circuits, or minimal dependence relations, and related approaches that use triangulations of polytopes have also been applied to biallelic systems. Here we provide biological interpretations…
▽ More
Walsh coefficients have been applied extensively to biallelic systems for quantifying pairwise and higher order epistasis, in particular for demonstrating the empirical importance of higher order interactions. Circuits, or minimal dependence relations, and related approaches that use triangulations of polytopes have also been applied to biallelic systems. Here we provide biological interpretations of Walsh coefficients for several alleles, and discuss circuits in the same general setting.
△ Less
Submitted 2 January, 2024; v1 submitted 1 January, 2024;
originally announced January 2024.
-
Multiallelic Walsh transforms
Authors:
Devin Greene
Abstract:
A closed formula multiallelic Walsh (or Hadamard) transform is introduced. Basic results are derived, and a statistical interpretation of some of the resulting linear forms is discussed.
A closed formula multiallelic Walsh (or Hadamard) transform is introduced. Basic results are derived, and a statistical interpretation of some of the resulting linear forms is discussed.
△ Less
Submitted 28 November, 2023;
originally announced November 2023.
-
A Primer for the Walsh Transform
Authors:
Devin Greene
Abstract:
A mathematical development of the Walsh transform, Walsh basis, and Walsh coefficients is given. The author was prompted to write this by a wish to give a unified treatment of epistatic coordinates as they are used in evolutionary biology. At the end of the article, opinions are expressed regarding the usefulness of these concepts for the practical researcher.
A mathematical development of the Walsh transform, Walsh basis, and Walsh coefficients is given. The author was prompted to write this by a wish to give a unified treatment of epistatic coordinates as they are used in evolutionary biology. At the end of the article, opinions are expressed regarding the usefulness of these concepts for the practical researcher.
△ Less
Submitted 3 November, 2023;
originally announced November 2023.
-
Unsupervised Probabilistic Models for Sequential Electronic Health Records
Authors:
Alan D. Kaplan,
John D. Greene,
Vincent X. Liu,
Priyadip Ray
Abstract:
We develop an unsupervised probabilistic model for heterogeneous Electronic Health Record (EHR) data. Utilizing a mixture model formulation, our approach directly models sequences of arbitrary length, such as medications and laboratory results. This allows for subgrouping and incorporation of the dynamics underlying heterogeneous data types. The model consists of a layered set of latent variables…
▽ More
We develop an unsupervised probabilistic model for heterogeneous Electronic Health Record (EHR) data. Utilizing a mixture model formulation, our approach directly models sequences of arbitrary length, such as medications and laboratory results. This allows for subgrouping and incorporation of the dynamics underlying heterogeneous data types. The model consists of a layered set of latent variables that encode underlying structure in the data. These variables represent subject subgroups at the top layer, and unobserved states for sequences in the second layer. We train this model on episodic data from subjects receiving medical care in the Kaiser Permanente Northern California integrated healthcare delivery system. The resulting properties of the trained model generate novel insight from these complex and multifaceted data. In addition, we show how the model can be used to analyze sequences that contribute to assessment of mortality likelihood.
△ Less
Submitted 31 August, 2022; v1 submitted 14 April, 2022;
originally announced April 2022.
-
Modeling sepsis progression using hidden Markov models
Authors:
Brenden K. Petersen,
Michael B. Mayhew,
Kalvin O. E. Ogbuefi,
John D. Greene,
Vincent X. Liu,
Priyadip Ray
Abstract:
Characterizing a patient's progression through stages of sepsis is critical for enabling risk stratification and adaptive, personalized treatment. However, commonly used sepsis diagnostic criteria fail to account for significant underlying heterogeneity, both between patients as well as over time in a single patient. We introduce a hidden Markov model of sepsis progression that explicitly accounts…
▽ More
Characterizing a patient's progression through stages of sepsis is critical for enabling risk stratification and adaptive, personalized treatment. However, commonly used sepsis diagnostic criteria fail to account for significant underlying heterogeneity, both between patients as well as over time in a single patient. We introduce a hidden Markov model of sepsis progression that explicitly accounts for patient heterogeneity. Benchmarked against two sepsis diagnostic criteria, the model provides a useful tool to uncover a patient's latent sepsis trajectory and to identify high-risk patients in whom more aggressive therapy may be indicated.
△ Less
Submitted 8 January, 2018;
originally announced January 2018.
-
Computational Analysis for the Rational Design of Anti-Amyloid Beta (ABeta) Antibodies
Authors:
D'Artagnan Greene,
Theodora Po,
Jennifer Pan,
Tanya Tabibian,
Ray Luo
Abstract:
Alzheimer's Disease (AD) is a neurodegenerative disorder that lacks effective treatment options. Anti-amyloid beta (ABeta) antibodies are the leading drug candidates to treat AD, but the results of clinical trials have been disappointing. Introducing rational mutations into anti-ABeta antibodies to increase their effectiveness is a way forward, but the path to take is unclear. In this study, we de…
▽ More
Alzheimer's Disease (AD) is a neurodegenerative disorder that lacks effective treatment options. Anti-amyloid beta (ABeta) antibodies are the leading drug candidates to treat AD, but the results of clinical trials have been disappointing. Introducing rational mutations into anti-ABeta antibodies to increase their effectiveness is a way forward, but the path to take is unclear. In this study, we demonstrate the use of computational fragment-based docking and MMPBSA binding free energy calculations in the analysis of anti-ABeta antibodies for rational drug design efforts. Our fragment-based docking method successfully predicted the emergence of the common EFRH epitope, MD simulations coupled with MMPBSA binding free energy calculations were used to analyze scenarios described in prior studies, and we introduced rational mutations into PFA1 to improve its calculated binding affinity towards the pE3-ABeta3-8 form of ABeta. Two out of four proposed mutations stabilized binding. Our study demonstrates that a computational approach may lead to an improved drug candidate for AD in the future.
△ Less
Submitted 22 February, 2018; v1 submitted 4 January, 2018;
originally announced January 2018.
-
Rational Design of Antibiotic Treatment Plans
Authors:
Portia M. Mira,
Kristina Crona,
Devin Greene,
Juan C. Meza,
Bernd Sturmfels,
Miriam Barlow
Abstract:
The development of reliable methods for restoring susceptibility after antibiotic resistance arises has proven elusive. A greater understanding of the relationship between antibiotic administration and the evolution of resistance is key to overcoming this challenge. Here we present a data-driven mathematical approach for developing antibiotic treatment plans that can reverse the evolution of antib…
▽ More
The development of reliable methods for restoring susceptibility after antibiotic resistance arises has proven elusive. A greater understanding of the relationship between antibiotic administration and the evolution of resistance is key to overcoming this challenge. Here we present a data-driven mathematical approach for developing antibiotic treatment plans that can reverse the evolution of antibiotic resistance determinants. We have generated adaptive landscapes for 16 genotypes of the TEM beta-lactamase that vary from the wild type genotype TEM-1 through all combinations of four amino acid substitutions. We determined the growth rate of each genotype when treated with each of 15 beta-lactam antibiotics. By using growth rates as a measure of fitness, we computed the probability of each amino acid substitution in each beta-lactam treatment using two different models named the Correlated Probability Model (CPM) and the Equal Probability Model (EPM). We then performed an exhaustive search through the 15 treatments for substitution paths leading from each of the 16 genotypes back to the wild type TEM-1. We identified those treatment paths that returned the highest probabilities of selecting for reversions of amino acid substitutions and returning TEM to the wild type state. For the CPM model, the optimized probabilities ranged between 0.6 and 1.0. For the EPM model, the optimized probabilities ranged between 0.38 and 1.0. For cyclical CPM treatment plans in which the starting and ending genotype was the wild type, the probabilities were between 0.62 and 0.7. Overall this study shows that there is promise for reversing the evolution of resistance through antibiotic treatment plans.
△ Less
Submitted 5 June, 2014;
originally announced June 2014.
-
The Changing Geometry of a Fitness Landscape Along an Adaptive Walk
Authors:
Devin Greene,
Kristina Crona
Abstract:
It has recently been noted that the relative prevalence of the various kinds of epistasis varies along an adaptive walk. This has been explained as a result of mean regression in NK model fitness landscapes. Here we show that this phenomenon occurs quite generally in fitness landscapes. We propose a simple and general explanation for this phenomemon, confirming the role of mean regression. We prov…
▽ More
It has recently been noted that the relative prevalence of the various kinds of epistasis varies along an adaptive walk. This has been explained as a result of mean regression in NK model fitness landscapes. Here we show that this phenomenon occurs quite generally in fitness landscapes. We propose a simple and general explanation for this phenomemon, confirming the role of mean regression. We provide support for this explanation with simulations, and discuss the empirical relevance of our findings.
△ Less
Submitted 14 December, 2013; v1 submitted 7 July, 2013;
originally announced July 2013.
-
Evolutionary Predictability and Complications with Additivity
Authors:
Kristina Crona,
Devin Greene,
Miriam Barlow
Abstract:
Adaptation is a central topic in theoretical biology, of practical importance for analyzing drug resistance mutations. Several authors have used arguments based on extreme value theory in their work on adaptation. There are complications with these approaches if fitness is additive (meaning that fitness effects of mutations sum), or whenever there is more additivity than what one would expect in a…
▽ More
Adaptation is a central topic in theoretical biology, of practical importance for analyzing drug resistance mutations. Several authors have used arguments based on extreme value theory in their work on adaptation. There are complications with these approaches if fitness is additive (meaning that fitness effects of mutations sum), or whenever there is more additivity than what one would expect in an uncorrelated fitness landscape. However, the approaches have been used in published work, even in situations with substantial amounts of additivity. In particular, extreme value theory has been used in discussions on evolutionary predictability. We say that evolution is predictable if the use of a particular drug at different locations tends lead to the same resistance mutations. Evolutionary predictability depends on the probabilities of mutational trajectories. Arguments about probabilities based on extreme value theory can be misleading. Additivity may cause errors in estimates of the probabilities of some mutational trajectories by a factor 20 even for rather small examples. We show that additivity gives systematic errors so as to exaggerate the differences between the most and the least likely trajectory. As a result of this bias, evolution may appear more predictable than it is. From a broader perspective, our results suggest that approaches which depend on the Orr-Gillespie theory are likely to give misleading results for realistic fitness landscapes whenever one considers adaptation in several steps.
△ Less
Submitted 14 December, 2013; v1 submitted 27 May, 2013;
originally announced May 2013.
-
Antibiotic resistance landscapes: a quantification of theory-data incompatibility for fitness landscapes
Authors:
Kristina Crona,
Dayonna Patterson,
Kelly Stack,
Devin Greene,
Christiane Goulart,
Mentar Mahmudi,
Stephen D. Jacobs,
Marcelo Kallman,
Miriam Barlow
Abstract:
Fitness landscapes are central in analyzing evolution, in particular for drug resistance mutations for bacteria and virus. We show that the fitness landscapes associated with antibiotic resistance are not compatible with any of the classical models; additive, uncorrelated and block fitness landscapes. The NK model is also discussed. It is frequently stated that virtually nothing is known about fit…
▽ More
Fitness landscapes are central in analyzing evolution, in particular for drug resistance mutations for bacteria and virus. We show that the fitness landscapes associated with antibiotic resistance are not compatible with any of the classical models; additive, uncorrelated and block fitness landscapes. The NK model is also discussed. It is frequently stated that virtually nothing is known about fitness landscapes in nature. We demonstrate that available records of antimicrobial drug mutations can reveal interesting properties of fitness landscapes in general. We apply the methods to analyze the TEM family of $β$-lactamases associated with antibiotic resistance. Laboratory results agree with our observations. The qualitative tools we suggest are well suited for comparisons of empirical fitness landscapes. Fitness landscapes are central in the theory of recombination and there is a potential for finding relations between the tools and recombination strategies.
△ Less
Submitted 15 March, 2013;
originally announced March 2013.
-
Designing antibiotic cycling strategies by determining and understanding local adaptive landscapes
Authors:
Christiane P. Goulart,
Mentar Mahmudi,
Kristina A. Crona,
Stephen D. Jacobs,
Marcelo Kallmann,
Barry G. Hall,
Devin C. Greene,
Miriam Barlow
Abstract:
The evolution of antibiotic resistance among bacteria threatens our continued ability to treat infectious diseases. The need for sustainable strategies to cure bacterial infections has never been greater. So far, all attempts to restore susceptibility after resistance has arisen have been unsuccessful, including restrictions on prescribing [1] and antibiotic cycling [2,3]. Part of the problem may…
▽ More
The evolution of antibiotic resistance among bacteria threatens our continued ability to treat infectious diseases. The need for sustainable strategies to cure bacterial infections has never been greater. So far, all attempts to restore susceptibility after resistance has arisen have been unsuccessful, including restrictions on prescribing [1] and antibiotic cycling [2,3]. Part of the problem may be that those efforts have implemented different classes of unrelated antibiotics, and relied on removal of resistance by random loss of resistance genes from bacterial populations (drift). Here, we show that alternating structurally similar antibiotics can restore susceptibility to antibiotics after resistance has evolved. We found that the resistance phenotypes conferred by variant alleles of the resistance gene encoding the TEM β-lactamase (blaTEM) varied greatly among 15 different β-lactam antibiotics. We captured those differences by characterizing complete adaptive landscapes for the resistance alleles blaTEM-50 and blaTEM-85, each of which differs from its ancestor blaTEM-1 by four mutations. We identified pathways through those landscapes where selection for increased resistance moved in a repeating cycle among a limited set of alleles as antibiotics were alternated. Our results showed that susceptibility to antibiotics can be sustainably renewed by cycling structurally similar antibiotics. We anticipate that these results may provide a conceptual framework for managing antibiotic resistance. This approach may also guide sustainable cycling of the drugs used to treat malaria and HIV.
△ Less
Submitted 23 January, 2013;
originally announced January 2013.