-
Minimum-Excess-Work Guidance
Authors:
Christopher Kolloff,
Tobias Höppe,
Emmanouil Angelis,
Mathias Jacob Schreiner,
Stefan Bauer,
Andrea Dittadi,
Simon Olsson
Abstract:
We propose a regularization framework inspired by thermodynamic work for guiding pre-trained probability flow generative models (e.g., continuous normalizing flows or diffusion models) by minimizing excess work, a concept rooted in statistical mechanics and with strong conceptual connections to optimal transport. Our approach enables efficient guidance in sparse-data regimes common to scientific a…
▽ More
We propose a regularization framework inspired by thermodynamic work for guiding pre-trained probability flow generative models (e.g., continuous normalizing flows or diffusion models) by minimizing excess work, a concept rooted in statistical mechanics and with strong conceptual connections to optimal transport. Our approach enables efficient guidance in sparse-data regimes common to scientific applications, where only limited target samples or partial density constraints are available. We introduce two strategies: Path Guidance for sampling rare transition states by concentrating probability mass on user-defined subsets, and Observable Guidance for aligning generated distributions with experimental observables while preserving entropy. We demonstrate the framework's versatility on a coarse-grained protein model, guiding it to sample transition configurations between folded/unfolded states and correct systematic biases using experimental data. The method bridges thermodynamic principles with modern generative architectures, offering a principled, efficient, and physics-inspired alternative to standard fine-tuning in data-scarce domains. Empirical results highlight improved sample efficiency and bias reduction, underscoring its applicability to molecular simulations and beyond.
△ Less
Submitted 23 May, 2025; v1 submitted 19 May, 2025;
originally announced May 2025.
-
In-silico biological discovery with large perturbation models
Authors:
Djordje Miladinovic,
Tobias Höppe,
Mathieu Chevalley,
Andreas Georgiou,
Lachlan Stuart,
Arash Mehrjou,
Marcus Bantscheff,
Bernhard Schölkopf,
Patrick Schwab
Abstract:
Data generated in perturbation experiments link perturbations to the changes they elicit and therefore contain information relevant to numerous biological discovery tasks -- from understanding the relationships between biological entities to developing therapeutics. However, these data encompass diverse perturbations and readouts, and the complex dependence of experimental outcomes on their biolog…
▽ More
Data generated in perturbation experiments link perturbations to the changes they elicit and therefore contain information relevant to numerous biological discovery tasks -- from understanding the relationships between biological entities to developing therapeutics. However, these data encompass diverse perturbations and readouts, and the complex dependence of experimental outcomes on their biological context makes it challenging to integrate insights across experiments. Here, we present the Large Perturbation Model (LPM), a deep-learning model that integrates multiple, heterogeneous perturbation experiments by representing perturbation, readout, and context as disentangled dimensions. LPM outperforms existing methods across multiple biological discovery tasks, including in predicting post-perturbation transcriptomes of unseen experiments, identifying shared molecular mechanisms of action between chemical and genetic perturbations, and facilitating the inference of gene-gene interaction networks.
△ Less
Submitted 30 March, 2025;
originally announced March 2025.
-
Diffusion Models for Video Prediction and Infilling
Authors:
Tobias Höppe,
Arash Mehrjou,
Stefan Bauer,
Didrik Nielsen,
Andrea Dittadi
Abstract:
Predicting and anticipating future outcomes or reasoning about missing information in a sequence are critical skills for agents to be able to make intelligent decisions. This requires strong, temporally coherent generative capabilities. Diffusion models have shown remarkable success in several generative tasks, but have not been extensively explored in the video domain. We present Random-Mask Vide…
▽ More
Predicting and anticipating future outcomes or reasoning about missing information in a sequence are critical skills for agents to be able to make intelligent decisions. This requires strong, temporally coherent generative capabilities. Diffusion models have shown remarkable success in several generative tasks, but have not been extensively explored in the video domain. We present Random-Mask Video Diffusion (RaMViD), which extends image diffusion models to videos using 3D convolutions, and introduces a new conditioning technique during training. By varying the mask we condition on, the model is able to perform video prediction, infilling, and upsampling. Due to our simple conditioning scheme, we can utilize the same architecture as used for unconditional training, which allows us to train the model in a conditional and unconditional fashion at the same time. We evaluate RaMViD on two benchmark datasets for video prediction, on which we achieve state-of-the-art results, and one for video generation. High-resolution videos are provided at https://sites.google.com/view/video-diffusion-prediction.
△ Less
Submitted 14 November, 2022; v1 submitted 15 June, 2022;
originally announced June 2022.
-
Everything counts! - Warum kleine Gemeinden die Gewinner der Zensuserhebung 2011 sind
Authors:
Björn Christensen,
Sören Christensen,
Tim Hoppe,
Michael Spandel
Abstract:
The population and housing census 2011 was an EU-wide census in all EU member states. In Germany, the basis was a largely register-based method. In this paper, it is shown that communities with less than 10.000 inhabitants have significantly less relative losses in the number of inhabitants compared to communities with more than 10.000 inhabitants.
The population and housing census 2011 was an EU-wide census in all EU member states. In Germany, the basis was a largely register-based method. In this paper, it is shown that communities with less than 10.000 inhabitants have significantly less relative losses in the number of inhabitants compared to communities with more than 10.000 inhabitants.
△ Less
Submitted 4 September, 2014;
originally announced September 2014.