-
A Machine Learning Early Warning System: Multicenter Validation in Brazilian Hospitals
Authors:
Jhonatan Kobylarz,
Henrique D. P. dos Santos,
Felipe Barletta,
Mateus Cichelero da Silva,
Renata Vieira,
Hugo M. P. Morales,
Cristian da Costa Rocha
Abstract:
Early recognition of clinical deterioration is one of the main steps for reducing inpatient morbidity and mortality. The challenging task of clinical deterioration identification in hospitals lies in the intense daily routines of healthcare practitioners, in the unconnected patient data stored in the Electronic Health Records (EHRs) and in the usage of low accuracy scores. Since hospital wards are…
▽ More
Early recognition of clinical deterioration is one of the main steps for reducing inpatient morbidity and mortality. The challenging task of clinical deterioration identification in hospitals lies in the intense daily routines of healthcare practitioners, in the unconnected patient data stored in the Electronic Health Records (EHRs) and in the usage of low accuracy scores. Since hospital wards are given less attention compared to the Intensive Care Unit, ICU, we hypothesized that when a platform is connected to a stream of EHR, there would be a drastic improvement in dangerous situations awareness and could thus assist the healthcare team. With the application of machine learning, the system is capable to consider all patient's history and through the use of high-performing predictive models, an intelligent early warning system is enabled. In this work we used 121,089 medical encounters from six different hospitals and 7,540,389 data points, and we compared popular ward protocols with six different scalable machine learning methods (three are classic machine learning models, logistic and probabilistic-based models, and three gradient boosted models). The results showed an advantage in AUC (Area Under the Receiver Operating Characteristic Curve) of 25 percentage points in the best Machine Learning model result compared to the current state-of-the-art protocols. This is shown by the generalization of the algorithm with leave-one-group-out (AUC of 0.949) and the robustness through cross-validation (AUC of 0.961). We also perform experiments to compare several window sizes to justify the use of five patient timestamps. A sample dataset, experiments, and code are available for replicability purposes.
△ Less
Submitted 9 June, 2020;
originally announced June 2020.
-
Online state and parameter estimation in Dynamic Generalised Linear Models
Authors:
Rui Vieira,
Darren J. Wilkinson
Abstract:
Inference for streaming time-series is tightly coupled with the problem of Bayesian on-line state and parameter inference. In this paper we will introduce Dynamic Generalised Linear Models, the class of models often chosen to model continuous and discrete time-series data. We will look at three different approaches which allow on-line estimation and analyse the results when applied to different re…
▽ More
Inference for streaming time-series is tightly coupled with the problem of Bayesian on-line state and parameter inference. In this paper we will introduce Dynamic Generalised Linear Models, the class of models often chosen to model continuous and discrete time-series data. We will look at three different approaches which allow on-line estimation and analyse the results when applied to different real world datasets related to inference for streaming data. Sufficient statistics based methods delay known problems, such as particle impoverishment, especially when applied to long running time-series, while providing reasonable parameter estimations when compared to exact methods, such as Particle Marginal Metropolis-Hastings. State and observation forecasts will also be analysed as a performance metric. By benchmarking against a "gold standard" (off-line) method, we can better understand the performance of on-line methods in challenging real-world scenarios.
△ Less
Submitted 30 August, 2016;
originally announced August 2016.
-
Bus Travel Time Predictions Using Additive Models
Authors:
Matthias Kormaksson,
Luciano Barbosa,
Marcos R. Vieira,
Bianca Zadrozny
Abstract:
Many factors can affect the predictability of public bus services such as traffic, weather and local events. Other aspects, such as day of week or hour of day, may influence bus travel times as well, either directly or in conjunction with other variables. However, the exact nature of such relationships between travel times and predictor variables is, in most situations, not known. In this paper we…
▽ More
Many factors can affect the predictability of public bus services such as traffic, weather and local events. Other aspects, such as day of week or hour of day, may influence bus travel times as well, either directly or in conjunction with other variables. However, the exact nature of such relationships between travel times and predictor variables is, in most situations, not known. In this paper we develop a framework that allows for flexible modeling of bus travel times through the use of Additive Models. In particular, we model travel times as a sum of linear as well as nonlinear terms that are modeled as smooth functions of predictor variables. The proposed class of models provides a principled statistical framework that is highly flexible in terms of model building. The experimental results demonstrate uniformly superior performance of our best model as compared to previous prediction methods when applied to a very large GPS data set obtained from buses operating in the city of Rio de Janeiro.
△ Less
Submitted 28 November, 2014;
originally announced November 2014.