-
Simulating the blood transfusion system in Kenya: Modelling methods and exploratory analyses
Authors:
Yiqi Tian,
Bo Zeng,
Jana MacLeod,
Gatwiri Murithi,
Cindy M. Makanga,
Hillary Barmasai,
Linda Barnes,
Rahul S. Bidanda,
Tonny Ejilkon Epuu,
Robert Kamu Kaburu,
Tecla Chelagat,
Jason Madan,
Jennifer Makin,
Alejandro Munoz-Valencia,
Carolyne Njoki,
Kevin Ochieng,
Bernard Olayo,
Jose Paiz,
Kristina E. Rudd,
Mark Yazer,
Juan Carlos Puyana,
Bopaya Bidanda,
Jayant Rajgopal,
Pratap Kumar
Abstract:
The process of collecting blood from donors and making it available for transfusion requires a complex series of operations involving multiple actors and resources at each step. Ensuring hospitals receive adequate and safe blood for transfusion is a common challenge across low- and middle-income countries, but is rarely addressed from a system level. This paper presents the first use of discrete e…
▽ More
The process of collecting blood from donors and making it available for transfusion requires a complex series of operations involving multiple actors and resources at each step. Ensuring hospitals receive adequate and safe blood for transfusion is a common challenge across low- and middle-income countries, but is rarely addressed from a system level. This paper presents the first use of discrete event simulation to study the blood system in Kenya and to explore the effect of variations and perturbations at different steps of the system on meeting patient blood demand. A process map of the Kenyan blood system was developed to capture critical steps from blood donation to transfusion using interviews with blood bank, hospital, and laboratory personnel at four public hospitals across three counties in Kenya. The blood system was simulated starting with blood collection, a blood bank where blood is tested and stored before it is issued, a major hospital attached to the blood bank, and several smaller hospitals served by the same blood bank. Values for supply-side parameters were based mainly on expert opinion; demand-side parameters were based on data from blood requisitions made in hospital wards, and dispatch of blood from the hospital laboratory. Illustrative examples demonstrate how the model can be used to explore the impacts of changes in blood collection (e.g., prioritising different donor types), blood demand (e.g., differing clinical case mix), and blood distribution (e.g., restocking strategies) on meeting demand at patient level. The model can reveal potential process impediments in the blood system and aid in choosing strategies for improving blood collection, distribution or use. Such a systems approach allows for interventions at different steps in the blood continuum to be tested on blood availability for different patients presenting at diverse hospitals across the country.
△ Less
Submitted 9 October, 2024;
originally announced October 2024.
-
Improving Speech Decoding from ECoG with Self-Supervised Pretraining
Authors:
Brian A. Yuan,
Joseph G. Makin
Abstract:
Recent work on intracranial brain-machine interfaces has demonstrated that spoken speech can be decoded with high accuracy, essentially by treating the problem as an instance of supervised learning and training deep neural networks to map from neural activity to text. However, such networks pay for their expressiveness with very large numbers of labeled data, a requirement that is particularly bur…
▽ More
Recent work on intracranial brain-machine interfaces has demonstrated that spoken speech can be decoded with high accuracy, essentially by treating the problem as an instance of supervised learning and training deep neural networks to map from neural activity to text. However, such networks pay for their expressiveness with very large numbers of labeled data, a requirement that is particularly burdensome for invasive neural recordings acquired from human patients. On the other hand, these patients typically produce speech outside of the experimental blocks used for training decoders. Making use of such data, and data from other patients, to improve decoding would ease the burden of data collection -- especially onerous for dys- and anarthric patients. Here we demonstrate that this is possible, by reengineering wav2vec -- a simple, self-supervised, fully convolutional model that learns latent representations of audio using a noise-contrastive loss -- for electrocorticographic (ECoG) data. We train this model on unlabelled ECoG recordings, and subsequently use it to transform ECoG from labeled speech sessions into wav2vec's representation space, before finally training a supervised encoder-decoder to map these representations to text. We experiment with various numbers of labeled blocks; for almost all choices, the new representations yield superior decoding performance to the original ECoG data, and in no cases do they yield worse. Performance can also be improved in some cases by pretraining wav2vec on another patient's data. In the best cases, wav2vec's representations decrease word error rates over the original data by upwards of 50%.
△ Less
Submitted 28 May, 2024;
originally announced May 2024.
-
Learning Recurrent Models with Temporally Local Rules
Authors:
Azwar Abdulsalam,
Joseph G. Makin
Abstract:
Fitting generative models to sequential data typically involves two recursive computations through time, one forward and one backward. The latter could be a computation of the loss gradient (as in backpropagation through time), or an inference algorithm (as in the RTS/Kalman smoother). The backward pass in particular is computationally expensive (since it is inherently serial and cannot exploit GP…
▽ More
Fitting generative models to sequential data typically involves two recursive computations through time, one forward and one backward. The latter could be a computation of the loss gradient (as in backpropagation through time), or an inference algorithm (as in the RTS/Kalman smoother). The backward pass in particular is computationally expensive (since it is inherently serial and cannot exploit GPUs), and difficult to map onto biological processes. Work-arounds have been proposed; here we explore a very different one: requiring the generative model to learn the joint distribution over current and previous states, rather than merely the transition probabilities. We show on toy datasets that different architectures employing this principle can learn aspects of the data typically requiring the backward pass.
△ Less
Submitted 20 October, 2023;
originally announced October 2023.
-
Inferring Population Dynamics in Macaque Cortex
Authors:
Ganga Meghanath,
Bryan Jimenez,
Joseph G. Makin
Abstract:
The proliferation of multi-unit cortical recordings over the last two decades, especially in macaques and during motor-control tasks, has generated interest in neural "population dynamics": the time evolution of neural activity across a group of neurons working together. A good model of these dynamics should be able to infer the activity of unobserved neurons within the same population and of the…
▽ More
The proliferation of multi-unit cortical recordings over the last two decades, especially in macaques and during motor-control tasks, has generated interest in neural "population dynamics": the time evolution of neural activity across a group of neurons working together. A good model of these dynamics should be able to infer the activity of unobserved neurons within the same population and of the observed neurons at future times. Accordingly, Pandarinath and colleagues have introduced a benchmark to evaluate models on these two (and related) criteria: four data sets, each consisting of firing rates from a population of neurons, recorded from macaque cortex during movement-related tasks. Here we show that simple, general-purpose architectures based on recurrent neural networks (RNNs) outperform more "bespoke" models, and indeed outperform all published models on all four data sets in the benchmark. Performance can be improved further still with a novel, hybrid architecture that augments the RNN with self-attention, as in transformer networks. But pure transformer models fail to achieve this level of performance, either in our work or that of other groups. We argue that the autoregressive bias imposed by RNNs is critical for achieving the highest levels of performance. We conclude, however, by proposing that the benchmark be augmented with an alternative evaluation of latent dynamics that favors generative over discriminative models like the ones we propose in this report.
△ Less
Submitted 19 October, 2023; v1 submitted 5 April, 2023;
originally announced April 2023.
-
An Introduction to Modern Statistical Learning
Authors:
Joseph G. Makin
Abstract:
This work in progress aims to provide a unified introduction to statistical learning, building up slowly from classical models like the GMM and HMM to modern neural networks like the VAE and diffusion models. There are today many internet resources that explain this or that new machine-learning algorithm in isolation, but they do not (and cannot, in so brief a space) connect these algorithms with…
▽ More
This work in progress aims to provide a unified introduction to statistical learning, building up slowly from classical models like the GMM and HMM to modern neural networks like the VAE and diffusion models. There are today many internet resources that explain this or that new machine-learning algorithm in isolation, but they do not (and cannot, in so brief a space) connect these algorithms with each other or with the classical literature on statistical models, out of which the modern algorithms emerged. Also conspicuously lacking is a single notational system which, although unfazing to those already familiar with the material (like the authors of these posts), raises a significant barrier to the novice's entry. Likewise, I have aimed to assimilate the various models, wherever possible, to a single framework for inference and learning, showing how (and why) to change one model into another with minimal alteration (some of them novel, others from the literature).
Some background is of course necessary. I have assumed the reader is familiar with basic multivariable calculus, probability and statistics, and linear algebra. The goal of this book is certainly not completeness, but rather to draw a more or less straight-line path from the basics to the extremely powerful new models of the last decade. The goal then is to complement, not replace, such comprehensive texts as Bishop's \emph{Pattern Recognition and Machine Learning}, which is now 15 years old.
△ Less
Submitted 20 July, 2022;
originally announced July 2022.
-
Recurrent Exponential-Family Harmoniums without Backprop-Through-Time
Authors:
Joseph G. Makin,
Benjamin K. Dichter,
Philip N. Sabes
Abstract:
Exponential-family harmoniums (EFHs), which extend restricted Boltzmann machines (RBMs) from Bernoulli random variables to other exponential families (Welling et al., 2005), are generative models that can be trained with unsupervised-learning techniques, like contrastive divergence (Hinton et al. 2006; Hinton, 2002), as density estimators for static data. Methods for extending RBMs--and likewise E…
▽ More
Exponential-family harmoniums (EFHs), which extend restricted Boltzmann machines (RBMs) from Bernoulli random variables to other exponential families (Welling et al., 2005), are generative models that can be trained with unsupervised-learning techniques, like contrastive divergence (Hinton et al. 2006; Hinton, 2002), as density estimators for static data. Methods for extending RBMs--and likewise EFHs--to data with temporal dependencies have been proposed previously (Sutskever and Hinton, 2007; Sutskever et al., 2009), the learning procedure being validated by qualitative assessment of the generative model. Here we propose and justify, from a very different perspective, an alternative training procedure, proving sufficient conditions for optimal inference under that procedure. The resulting algorithm can be learned with only forward passes through the data--backprop-through-time is not required, as in previous approaches. The proof exploits a recent result about information retention in density estimators (Makin and Sabes, 2015), and applies it to a "recurrent EFH" (rEFH) by induction. Finally, we demonstrate optimality by simulation, testing the rEFH: (1) as a filter on training data generated with a linear dynamical system, the position of which is noisily reported by a population of "neurons" with Poisson-distributed spike counts; and (2) with the qualitative experiments proposed by Sutskever et al. (2009).
△ Less
Submitted 18 May, 2016;
originally announced May 2016.