-
Towards Foundation Models for Critical Care Time Series
Authors:
Manuel Burger,
Fedor Sergeev,
Malte Londschien,
Daphné Chopard,
Hugo Yèche,
Eike Gerdes,
Polina Leshetkina,
Alexander Morgenroth,
Zeynep Babür,
Jasmina Bogojeska,
Martin Faltys,
Rita Kuznetsova,
Gunnar Rätsch
Abstract:
Notable progress has been made in generalist medical large language models across various healthcare areas. However, large-scale modeling of in-hospital time series data - such as vital signs, lab results, and treatments in critical care - remains underexplored. Existing datasets are relatively small, but combining them can enhance patient diversity and improve model robustness. To effectively uti…
▽ More
Notable progress has been made in generalist medical large language models across various healthcare areas. However, large-scale modeling of in-hospital time series data - such as vital signs, lab results, and treatments in critical care - remains underexplored. Existing datasets are relatively small, but combining them can enhance patient diversity and improve model robustness. To effectively utilize these combined datasets for large-scale modeling, it is essential to address the distribution shifts caused by varying treatment policies, necessitating the harmonization of treatment variables across the different datasets. This work aims to establish a foundation for training large-scale multi-variate time series models on critical care data and to provide a benchmark for machine learning models in transfer learning across hospitals to study and address distribution shift challenges. We introduce a harmonized dataset for sequence modeling and transfer learning research, representing the first large-scale collection to include core treatment variables. Future plans involve expanding this dataset to support further advancements in transfer learning and the development of scalable, generalizable models for critical healthcare applications.
△ Less
Submitted 25 November, 2024;
originally announced November 2024.
-
WiNNbeta: Batch and drift correction method by white noise normalization for metabolomic studies
Authors:
Olga Demler,
Franco Giulianini,
Yanyan Liu,
Malte Londschien,
Anja Sjöström,
Tanmay Tanna,
Heike Luttmann-Gibson,
Antoine Jeanrenaud
Abstract:
We developed a method called batch and drift correction method by White Noise Normalization (WiNNbeta) to correct individual metabolites for batch effects and drifts. This method tests for white noise properties to identify metabolites in need of correction and corrects them by using fine-tuned splines. To test the method performance we applied WiNNbeta to LC-MS data from our metabolomic studies a…
▽ More
We developed a method called batch and drift correction method by White Noise Normalization (WiNNbeta) to correct individual metabolites for batch effects and drifts. This method tests for white noise properties to identify metabolites in need of correction and corrects them by using fine-tuned splines. To test the method performance we applied WiNNbeta to LC-MS data from our metabolomic studies and computed CVs before and after WiNNbeta correction in quality control samples.
△ Less
Submitted 11 April, 2024;
originally announced April 2024.
-
Random Forests for Change Point Detection
Authors:
Malte Londschien,
Peter Bühlmann,
Solt Kovács
Abstract:
We propose a novel multivariate nonparametric multiple change point detection method using classifiers. We construct a classifier log-likelihood ratio that uses class probability predictions to compare different change point configurations. We propose a computationally feasible search method that is particularly well suited for random forests, denoted by changeforest. However, the method can be pa…
▽ More
We propose a novel multivariate nonparametric multiple change point detection method using classifiers. We construct a classifier log-likelihood ratio that uses class probability predictions to compare different change point configurations. We propose a computationally feasible search method that is particularly well suited for random forests, denoted by changeforest. However, the method can be paired with any classifier that yields class probability predictions, which we illustrate by also using a k-nearest neighbor classifier. We prove that it consistently locates change points in single change point settings when paired with a consistent classifier. Our proposed method changeforest achieves improved empirical performance in an extensive simulation study compared to existing multivariate nonparametric change point detection methods. An efficient implementation of our method is made available for R, Python, and Rust users in the changeforest software package.
△ Less
Submitted 15 August, 2023; v1 submitted 10 May, 2022;
originally announced May 2022.
-
Change point detection for graphical models in the presence of missing values
Authors:
Malte Londschien,
Solt Kovács,
Peter Bühlmann
Abstract:
We propose estimation methods for change points in high-dimensional covariance structures with an emphasis on challenging scenarios with missing values. We advocate three imputation like methods and investigate their implications on common losses used for change point detection. We also discuss how model selection methods have to be adapted to the setting of incomplete data. The methods are compar…
▽ More
We propose estimation methods for change points in high-dimensional covariance structures with an emphasis on challenging scenarios with missing values. We advocate three imputation like methods and investigate their implications on common losses used for change point detection. We also discuss how model selection methods have to be adapted to the setting of incomplete data. The methods are compared in a simulation study and applied to a time series from an environmental monitoring system. An implementation of our proposals within the R-package hdcd is available via the Supplementary materials.
△ Less
Submitted 22 October, 2020; v1 submitted 11 July, 2019;
originally announced July 2019.