Skip to main content

Showing 1–14 of 14 results for author: Sendera, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.09746  [pdf, other

    cs.LG cs.AI stat.ML

    Solving Bayesian inverse problems with diffusion priors and off-policy RL

    Authors: Luca Scimeca, Siddarth Venkatraman, Moksh Jain, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yashar Hezaveh, Laurence Perreault-Levasseur, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: This paper presents a practical application of Relative Trajectory Balance (RTB), a recently introduced off-policy reinforcement learning (RL) objective that can asymptotically solve Bayesian inverse problems optimally. We extend the original work by using RTB to train conditional diffusion model posteriors from pretrained unconditional priors for challenging linear and non-linear inverse problems… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: Accepted as workshop paper at DeLTa workshop, ICLR 2025. arXiv admin note: substantial text overlap with arXiv:2405.20971

  2. arXiv:2502.07587  [pdf, other

    cs.LG

    SEMU: Singular Value Decomposition for Efficient Machine Unlearning

    Authors: Marcin Sendera, Łukasz Struski, Kamil Książek, Kryspin Musiol, Jacek Tabor, Dawid Rymarczyk

    Abstract: While the capabilities of generative foundational models have advanced rapidly in recent years, methods to prevent harmful and unsafe behaviors remain underdeveloped. Among the pressing challenges in AI safety, machine unlearning (MU) has become increasingly critical to meet upcoming safety regulations. Most existing MU approaches focus on altering the most significant parameters of the model. How… ▽ More

    Submitted 11 February, 2025; originally announced February 2025.

  3. arXiv:2502.06999  [pdf, other

    cs.LG

    Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models

    Authors: Siddarth Venkatraman, Mohsin Hasan, Minsu Kim, Luca Scimeca, Marcin Sendera, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: Any well-behaved generative model over a variable $\mathbf{x}$ can be expressed as a deterministic transformation of an exogenous ('outsourced') Gaussian noise variable $\mathbf{z}$: $\mathbf{x}=f_θ(\mathbf{z})$. In such a model (\eg, a VAE, GAN, or continuous-time flow-based model), sampling of the target variable $\mathbf{x} \sim p_θ(\mathbf{x})$ is straightforward, but sampling from a posterior… ▽ More

    Submitted 28 May, 2025; v1 submitted 10 February, 2025; originally announced February 2025.

    Comments: ICML 2025; code: https://github.com/HyperPotatoNeo/Outsourced_Diffusion_Sampling

  4. arXiv:2501.06148  [pdf, other

    cs.LG stat.ML

    From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training

    Authors: Julius Berner, Lorenz Richter, Marcin Sendera, Jarrid Rector-Brooks, Nikolay Malkin

    Abstract: We study the problem of training neural stochastic differential equations, or diffusion models, to sample from a Boltzmann distribution without access to target samples. Existing methods for training such models enforce time-reversal of the generative and noising processes, using either differentiable simulation or off-policy reinforcement learning (RL). We prove equivalences between families of o… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

    Comments: code: https://github.com/GFNOrg/gfn-diffusion/tree/stagger

  5. arXiv:2410.15777  [pdf, ps, other

    cs.LG stat.ML

    Revisiting the Equivalence of Bayesian Neural Networks and Gaussian Processes: On the Importance of Learning Activations

    Authors: Marcin Sendera, Amin Sorkhei, Tomasz Kuśmierczyk

    Abstract: Gaussian Processes (GPs) provide a convenient framework for specifying function-space priors, making them a natural choice for modeling uncertainty. In contrast, Bayesian Neural Networks (BNNs) offer greater scalability and extendability but lack the advantageous properties of GPs. This motivates the development of BNNs capable of replicating GP-like behavior. However, existing solutions are eithe… ▽ More

    Submitted 11 June, 2025; v1 submitted 21 October, 2024; originally announced October 2024.

    Comments: Accepted to the 41st Conference on Uncertainty in Artificial Intelligence (UAI 2025). PMLR 244

  6. arXiv:2410.03941  [pdf, other

    cs.CV

    AutoLoRA: AutoGuidance Meets Low-Rank Adaptation for Diffusion Models

    Authors: Artur Kasymov, Marcin Sendera, Michał Stypułkowski, Maciej Zięba, Przemysław Spurek

    Abstract: Low-rank adaptation (LoRA) is a fine-tuning technique that can be applied to conditional generative diffusion models. LoRA utilizes a small number of context examples to adapt the model to a specific domain, character, style, or concept. However, due to the limited data utilized during training, the fine-tuned model performance is often characterized by strong context bias and a low degree of vari… ▽ More

    Submitted 4 October, 2024; originally announced October 2024.

  7. arXiv:2405.20971  [pdf, other

    cs.LG cs.CV

    Amortizing intractable inference in diffusion models for vision, language, and control

    Authors: Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: Diffusion models have emerged as effective distribution estimators in vision, language, and reinforcement learning, but their use as priors in downstream tasks poses an intractable posterior inference problem. This paper studies amortized sampling of the posterior over data, $\mathbf{x}\sim p^{\rm post}(\mathbf{x})\propto p(\mathbf{x})r(\mathbf{x})$, in a model that consists of a diffusion generat… ▽ More

    Submitted 13 January, 2025; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024; code: https://github.com/GFNOrg/diffusion-finetuning

  8. arXiv:2402.06121  [pdf, other

    cs.LG stat.ML

    Iterated Denoising Energy Matching for Sampling from Boltzmann Densities

    Authors: Tara Akhound-Sadegh, Jarrid Rector-Brooks, Avishek Joey Bose, Sarthak Mittal, Pablo Lemos, Cheng-Hao Liu, Marcin Sendera, Siamak Ravanbakhsh, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Alexander Tong

    Abstract: Efficiently generating statistically independent samples from an unnormalized probability distribution, such as equilibrium samples of many-body systems, is a foundational problem in science. In this paper, we propose Iterated Denoising Energy Matching (iDEM), an iterative algorithm that uses a novel stochastic score matching objective leveraging solely the energy function and its gradient -- and… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Published at ICML 2024. Code for iDEM is available at https://github.com/jarridrb/dem

  9. arXiv:2402.05098  [pdf, other

    cs.LG stat.ML

    Improved off-policy training of diffusion samplers

    Authors: Marcin Sendera, Minsu Kim, Sarthak Mittal, Pablo Lemos, Luca Scimeca, Jarrid Rector-Brooks, Alexandre Adam, Yoshua Bengio, Nikolay Malkin

    Abstract: We study the problem of training diffusion models to sample from a distribution with a given unnormalized density or energy function. We benchmark several diffusion-structured inference methods, including simulation-based variational approaches and off-policy methods (continuous generative flow networks). Our results shed light on the relative advantages of existing algorithms while bringing into… ▽ More

    Submitted 13 January, 2025; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: NeurIPS 2024; code: https://github.com/GFNOrg/gfn-diffusion

  10. arXiv:2203.11378  [pdf, other

    cs.LG cs.AI cs.CV

    HyperShot: Few-Shot Learning by Kernel HyperNetworks

    Authors: Marcin Sendera, Marcin Przewięźlikowski, Konrad Karanowski, Maciej Zięba, Jacek Tabor, Przemysław Spurek

    Abstract: Few-shot models aim at making predictions using a minimal number of labeled examples from a given task. The main challenge in this area is the one-shot setting where only one element represents each class. We propose HyperShot - the fusion of kernels and hypernetwork paradigm. Compared to reference approaches that apply a gradient-based adjustment of the parameters, our model aims to switch the cl… ▽ More

    Submitted 21 March, 2022; originally announced March 2022.

  11. arXiv:2110.13561  [pdf, other

    cs.LG

    Non-Gaussian Gaussian Processes for Few-Shot Regression

    Authors: Marcin Sendera, Jacek Tabor, Aleksandra Nowak, Andrzej Bedychaj, Massimiliano Patacchiola, Tomasz Trzciński, Przemysław Spurek, Maciej Zięba

    Abstract: Gaussian Processes (GPs) have been widely used in machine learning to model distributions over functions, with applications including multi-modal regression, time-series prediction, and few-shot learning. GPs are particularly useful in the last application since they rely on Normal distributions and enable closed-form computation of the posterior probability function. Unfortunately, because the re… ▽ More

    Submitted 26 October, 2021; originally announced October 2021.

  12. arXiv:2108.04907  [pdf, other

    cs.LG

    Flow-based SVDD for anomaly detection

    Authors: Marcin Sendera, Marek Śmieja, Łukasz Maziarka, Łukasz Struski, Przemysław Spurek, Jacek Tabor

    Abstract: We propose FlowSVDD -- a flow-based one-class classifier for anomaly/outliers detection that realizes a well-known SVDD principle using deep learning tools. Contrary to other approaches to deep SVDD, the proposed model is instantiated using flow-based models, which naturally prevents from collapsing of bounding hypersphere into a single point. Experiments show that FlowSVDD achieves comparable res… ▽ More

    Submitted 10 August, 2021; originally announced August 2021.

    Comments: arXiv admin note: text overlap with arXiv:2010.03002

  13. OneFlow: One-class flow for anomaly detection based on a minimal volume region

    Authors: Łukasz Maziarka, Marek Śmieja, Marcin Sendera, Łukasz Struski, Jacek Tabor, Przemysław Spurek

    Abstract: We propose OneFlow - a flow-based one-class classifier for anomaly (outlier) detection that finds a minimal volume bounding region. Contrary to density-based methods, OneFlow is constructed in such a way that its result typically does not depend on the structure of outliers. This is caused by the fact that during training the gradient of the cost function is propagated only over the points located… ▽ More

    Submitted 22 September, 2021; v1 submitted 6 October, 2020; originally announced October 2020.

    Journal ref: 2021, IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)

  14. arXiv:1904.04309  [pdf

    cs.LG stat.ML

    Data adaptation in HANDY economy-ideology model

    Authors: Marcin Sendera

    Abstract: The concept of mathematical modeling is widespread across almost all of the fields of contemporary science and engineering. Because of the existing necessity of predictions the behavior of natural phenomena, the researchers develop more and more complex models. However, despite their ability to better forecasting, the problem of an appropriate fitting ground truth data to those, high-dimensional a… ▽ More

    Submitted 8 April, 2019; originally announced April 2019.

    Comments: 172 pages