Skip to main content

Showing 1–50 of 51 results for author: Malkin, N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.01179  [pdf, other

    cs.RO

    Fast Flow-based Visuomotor Policies via Conditional Optimal Transport Couplings

    Authors: Andreas Sochopoulos, Nikolay Malkin, Nikolaos Tsagkas, João Moura, Michael Gienger, Sethu Vijayakumar

    Abstract: Diffusion and flow matching policies have recently demonstrated remarkable performance in robotic applications by accurately capturing multimodal robot trajectory distributions. However, their computationally expensive inference, due to the numerical integration of an ODE or SDE, limits their applicability as real-time controllers for robots. We introduce a methodology that utilizes conditional Op… ▽ More

    Submitted 2 May, 2025; originally announced May 2025.

  2. arXiv:2503.09746  [pdf, other

    cs.LG cs.AI stat.ML

    Solving Bayesian inverse problems with diffusion priors and off-policy RL

    Authors: Luca Scimeca, Siddarth Venkatraman, Moksh Jain, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yashar Hezaveh, Laurence Perreault-Levasseur, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: This paper presents a practical application of Relative Trajectory Balance (RTB), a recently introduced off-policy reinforcement learning (RL) objective that can asymptotically solve Bayesian inverse problems optimally. We extend the original work by using RTB to train conditional diffusion model posteriors from pretrained unconditional priors for challenging linear and non-linear inverse problems… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: Accepted as workshop paper at DeLTa workshop, ICLR 2025. arXiv admin note: substantial text overlap with arXiv:2405.20971

  3. arXiv:2503.06985  [pdf, other

    cs.LG

    Learning Decision Trees as Amortized Structure Inference

    Authors: Mohammed Mahfoud, Ghait Boukachab, Michał Koziarski, Alex Hernandez-Garcia, Stefan Bauer, Yoshua Bengio, Nikolay Malkin

    Abstract: Building predictive models for tabular data presents fundamental challenges, notably in scaling consistently, i.e., more resources translating to better performance, and generalizing systematically beyond the training data distribution. Designing decision tree models remains especially challenging given the intractably large search space, and most existing methods rely on greedy heuristics, while… ▽ More

    Submitted 10 March, 2025; originally announced March 2025.

    Comments: Code: $\href{https://github.com/GFNOrg/dt-gfn}{https://github.com/GFNOrg/dt-gfn}$

  4. arXiv:2502.11617  [pdf, other

    cs.LG cs.AI stat.ML

    In-Context Parametric Inference: Point or Distribution Estimators?

    Authors: Sarthak Mittal, Yoshua Bengio, Nikolay Malkin, Guillaume Lajoie

    Abstract: Bayesian and frequentist inference are two fundamental paradigms in statistical estimation. Bayesian methods treat hypotheses as random variables, incorporating priors and updating beliefs via Bayes' theorem, whereas frequentist methods assume fixed but unknown hypotheses, relying on estimators like maximum likelihood. While extensive research has compared these approaches, the frequentist paradig… ▽ More

    Submitted 17 February, 2025; originally announced February 2025.

  5. arXiv:2502.06999  [pdf, other

    cs.LG

    Outsourced diffusion sampling: Efficient posterior inference in latent spaces of generative models

    Authors: Siddarth Venkatraman, Mohsin Hasan, Minsu Kim, Luca Scimeca, Marcin Sendera, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: Any well-behaved generative model over a variable $\mathbf{x}$ can be expressed as a deterministic transformation of an exogenous ('outsourced') Gaussian noise variable $\mathbf{z}$: $\mathbf{x}=f_θ(\mathbf{z})$. In such a model (e.g., a VAE, GAN, or continuous-time flow-based model), sampling of the target variable $\mathbf{x} \sim p_θ(\mathbf{x})$ is straightforward, but sampling from a posterio… ▽ More

    Submitted 10 February, 2025; originally announced February 2025.

  6. arXiv:2501.06148  [pdf, other

    cs.LG stat.ML

    From discrete-time policies to continuous-time diffusion samplers: Asymptotic equivalences and faster training

    Authors: Julius Berner, Lorenz Richter, Marcin Sendera, Jarrid Rector-Brooks, Nikolay Malkin

    Abstract: We study the problem of training neural stochastic differential equations, or diffusion models, to sample from a Boltzmann distribution without access to target samples. Existing methods for training such models enforce time-reversal of the generative and noising processes, using either differentiable simulation or off-policy reinforcement learning (RL). We prove equivalences between families of o… ▽ More

    Submitted 10 January, 2025; originally announced January 2025.

    Comments: code: https://github.com/GFNOrg/gfn-diffusion/tree/stagger

  7. arXiv:2411.02830  [pdf, other

    cs.CL cs.AI cs.LG

    Mixtures of In-Context Learners

    Authors: Giwon Hong, Emile van Krieken, Edoardo Ponti, Nikolay Malkin, Pasquale Minervini

    Abstract: In-context learning (ICL) adapts LLMs by providing demonstrations without fine-tuning the model parameters; however, it does not differentiate between demonstrations and quadratically increases the complexity of Transformer LLMs, exhausting the memory. As a solution, we propose Mixtures of In-Context Learners (MoICL), a novel approach to treat subsets of demonstrations as experts and learn a weigh… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  8. arXiv:2410.15184  [pdf, other

    cs.LG cs.AI

    Action abstractions for amortized sampling

    Authors: Oussama Boussif, Léna Néhale Ezzine, Joseph D Viviano, Michał Koziarski, Moksh Jain, Nikolay Malkin, Emmanuel Bengio, Rim Assouel, Yoshua Bengio

    Abstract: As trajectories sampled by policies used by reinforcement learning (RL) and generative flow networks (GFlowNets) grow longer, credit assignment and exploration become more challenging, and the long planning horizon hinders mode discovery and generalization. The challenge is particularly pronounced in entropy-seeking RL methods, such as generative flow networks, where the agent must learn to sample… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

  9. arXiv:2410.13224  [pdf, other

    cs.CL

    Proof Flow: Preliminary Study on Generative Flow Network Language Model Tuning for Formal Reasoning

    Authors: Matthew Ho, Vincent Zhu, Xiaoyin Chen, Moksh Jain, Nikolay Malkin, Edwin Zhang

    Abstract: Reasoning is a fundamental substrate for solving novel and complex problems. Deliberate efforts in learning and developing frameworks around System 2 reasoning have made great strides, yet problems of sufficient complexity remain largely out of reach for open models. To address this gap, we examine the potential of Generative Flow Networks as a fine-tuning method for LLMs to unlock advanced reason… ▽ More

    Submitted 17 October, 2024; originally announced October 2024.

  10. arXiv:2410.01432  [pdf, other

    cs.LG stat.ML

    Adaptive teachers for amortized samplers

    Authors: Minsu Kim, Sanghyeok Choi, Taeyoung Yun, Emmanuel Bengio, Leo Feng, Jarrid Rector-Brooks, Sungsoo Ahn, Jinkyoo Park, Nikolay Malkin, Yoshua Bengio

    Abstract: Amortized inference is the task of training a parametric model, such as a neural network, to approximate a distribution with a given unnormalized density where exact sampling is intractable. When sampling is implemented as a sequential decision-making process, reinforcement learning (RL) methods, such as generative flow networks, can be used to train the sampling policy. Off-policy RL training fac… ▽ More

    Submitted 14 April, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: ICLR 2025, 27 pages, 12 figures

  11. arXiv:2408.05284  [pdf, other

    cs.AI cs.LG

    Can a Bayesian Oracle Prevent Harm from an Agent?

    Authors: Yoshua Bengio, Michael K. Cohen, Nikolay Malkin, Matt MacDermott, Damiano Fornasiere, Pietro Greiner, Younesse Kaddar

    Abstract: Is there a way to design powerful AI systems based on machine learning methods that would satisfy probabilistic safety guarantees? With the long-term goal of obtaining a probabilistic guarantee that would apply in every context, we consider estimating a context-dependent bound on the probability of violating a given safety specification. Such a risk evaluation would need to be performed at run-tim… ▽ More

    Submitted 22 August, 2024; v1 submitted 9 August, 2024; originally announced August 2024.

  12. arXiv:2407.03105  [pdf, other

    cs.LG

    On Generalization for Generative Flow Networks

    Authors: Anas Krichel, Nikolay Malkin, Salem Lahlou, Yoshua Bengio

    Abstract: Generative Flow Networks (GFlowNets) have emerged as an innovative learning paradigm designed to address the challenge of sampling from an unnormalized probability distribution, called the reward function. This framework learns a policy on a constructed graph, which enables sampling from an approximation of the target probability distribution through successive steps of sampling from the learned p… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  13. arXiv:2405.20971  [pdf, other

    cs.LG cs.CV

    Amortizing intractable inference in diffusion models for vision, language, and control

    Authors: Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: Diffusion models have emerged as effective distribution estimators in vision, language, and reinforcement learning, but their use as priors in downstream tasks poses an intractable posterior inference problem. This paper studies amortized sampling of the posterior over data, $\mathbf{x}\sim p^{\rm post}(\mathbf{x})\propto p(\mathbf{x})r(\mathbf{x})$, in a model that consists of a diffusion generat… ▽ More

    Submitted 13 January, 2025; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024; code: https://github.com/GFNOrg/diffusion-finetuning

  14. arXiv:2405.18540  [pdf, other

    cs.CL cs.CR cs.LG

    Learning diverse attacks on large language models for robust red-teaming and safety tuning

    Authors: Seanie Lee, Minsu Kim, Lynn Cherif, David Dobre, Juho Lee, Sung Ju Hwang, Kenji Kawaguchi, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Moksh Jain

    Abstract: Red-teaming, or identifying prompts that elicit harmful responses, is a critical step in ensuring the safe and responsible deployment of large language models (LLMs). Developing effective protection against many modes of attack prompts requires discovering diverse attacks. Automated red-teaming typically uses reinforcement learning to fine-tune an attacker language model to generate prompts that e… ▽ More

    Submitted 28 February, 2025; v1 submitted 28 May, 2024; originally announced May 2024.

    Comments: ICLR 2025

  15. arXiv:2403.04571  [pdf, ps, other

    cs.AI

    Machine learning and information theory concepts towards an AI Mathematician

    Authors: Yoshua Bengio, Nikolay Malkin

    Abstract: The current state-of-the-art in artificial intelligence is impressive, especially in terms of mastery of language, but not so much in terms of mathematical reasoning. What could be missing? Can we learn something useful about that gap from how the brains of mathematicians go about their craft? This essay builds on the idea that current deep learning mostly succeeds at system 1 abilities -- which c… ▽ More

    Submitted 7 March, 2024; originally announced March 2024.

    Comments: To appear in the Bulletin of the AMS, 2024

  16. arXiv:2402.10309  [pdf, other

    cs.LG

    Discrete Probabilistic Inference as Control in Multi-path Environments

    Authors: Tristan Deleu, Padideh Nouri, Nikolay Malkin, Doina Precup, Yoshua Bengio

    Abstract: We consider the problem of sampling from a discrete and structured distribution as a sequential decision problem, where the objective is to find a stochastic policy such that objects are sampled at the end of this sequential process proportionally to some predefined reward. While we could use maximum entropy Reinforcement Learning (MaxEnt RL) to solve this problem for some distributions, it has be… ▽ More

    Submitted 27 May, 2024; v1 submitted 15 February, 2024; originally announced February 2024.

  17. arXiv:2402.06457  [pdf, other

    cs.LG cs.AI cs.CL

    V-STaR: Training Verifiers for Self-Taught Reasoners

    Authors: Arian Hosseini, Xingdi Yuan, Nikolay Malkin, Aaron Courville, Alessandro Sordoni, Rishabh Agarwal

    Abstract: Common self-improvement approaches for large language models (LLMs), such as STaR, iteratively fine-tune LLMs on self-generated solutions to improve their problem-solving ability. However, these approaches discard the large amounts of incorrect solutions generated during this process, potentially neglecting valuable information in such solutions. To address this shortcoming, we propose V-STaR that… ▽ More

    Submitted 13 August, 2024; v1 submitted 9 February, 2024; originally announced February 2024.

  18. arXiv:2402.06121  [pdf, other

    cs.LG stat.ML

    Iterated Denoising Energy Matching for Sampling from Boltzmann Densities

    Authors: Tara Akhound-Sadegh, Jarrid Rector-Brooks, Avishek Joey Bose, Sarthak Mittal, Pablo Lemos, Cheng-Hao Liu, Marcin Sendera, Siamak Ravanbakhsh, Gauthier Gidel, Yoshua Bengio, Nikolay Malkin, Alexander Tong

    Abstract: Efficiently generating statistically independent samples from an unnormalized probability distribution, such as equilibrium samples of many-body systems, is a foundational problem in science. In this paper, we propose Iterated Denoising Energy Matching (iDEM), an iterative algorithm that uses a novel stochastic score matching objective leveraging solely the energy function and its gradient -- and… ▽ More

    Submitted 26 June, 2024; v1 submitted 8 February, 2024; originally announced February 2024.

    Comments: Published at ICML 2024. Code for iDEM is available at https://github.com/jarridrb/dem

  19. arXiv:2402.05098  [pdf, other

    cs.LG stat.ML

    Improved off-policy training of diffusion samplers

    Authors: Marcin Sendera, Minsu Kim, Sarthak Mittal, Pablo Lemos, Luca Scimeca, Jarrid Rector-Brooks, Alexandre Adam, Yoshua Bengio, Nikolay Malkin

    Abstract: We study the problem of training diffusion models to sample from a distribution with a given unnormalized density or energy function. We benchmark several diffusion-structured inference methods, including simulation-based variational approaches and off-policy methods (continuous generative flow networks). Our results shed light on the relative advantages of existing algorithms while bringing into… ▽ More

    Submitted 13 January, 2025; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: NeurIPS 2024; code: https://github.com/GFNOrg/gfn-diffusion

  20. arXiv:2402.04355  [pdf, other

    stat.ML cs.AI cs.LG stat.ME

    PQMass: Probabilistic Assessment of the Quality of Generative Models using Probability Mass Estimation

    Authors: Pablo Lemos, Sammy Sharief, Nikolay Malkin, Salma Salhi, Connor Stone, Laurence Perreault-Levasseur, Yashar Hezaveh

    Abstract: We propose a likelihood-free method for comparing two distributions given samples from each, with the goal of assessing the quality of generative models. The proposed approach, PQMass, provides a statistically rigorous method for assessing the performance of a single generative model or the comparison of multiple competing models. PQMass divides the sample space into non-overlapping regions and ap… ▽ More

    Submitted 6 March, 2025; v1 submitted 6 February, 2024; originally announced February 2024.

    Comments: Published as a conference paper at ICLR 2025

  21. arXiv:2312.03911  [pdf, other

    cs.LG stat.CO stat.ME stat.ML

    Improving Gradient-guided Nested Sampling for Posterior Inference

    Authors: Pablo Lemos, Nikolay Malkin, Will Handley, Yoshua Bengio, Yashar Hezaveh, Laurence Perreault-Levasseur

    Abstract: We present a performant, general-purpose gradient-guided nested sampling algorithm, ${\tt GGNS}$, combining the state of the art in differentiable programming, Hamiltonian slice sampling, clustering, mode separation, dynamic nested sampling, and parallelization. This unique combination allows ${\tt GGNS}$ to scale well with dimensionality and perform competitively on a variety of synthetic and rea… ▽ More

    Submitted 6 December, 2023; originally announced December 2023.

    Comments: 10 pages, 5 figures. Code available at https://github.com/Pablo-Lemos/GGNS

  22. arXiv:2310.08774  [pdf, other

    q-bio.PE cs.LG stat.ML

    PhyloGFN: Phylogenetic inference with generative flow networks

    Authors: Mingyang Zhou, Zichao Yan, Elliot Layne, Nikolay Malkin, Dinghuai Zhang, Moksh Jain, Mathieu Blanchette, Yoshua Bengio

    Abstract: Phylogenetics is a branch of computational biology that studies the evolutionary relationships among biological entities. Its long history and numerous applications notwithstanding, inference of phylogenetic trees from sequence data remains challenging: the high complexity of tree space poses a significant obstacle for the current combinatorial and probabilistic techniques. In this paper, we adopt… ▽ More

    Submitted 24 March, 2024; v1 submitted 12 October, 2023; originally announced October 2023.

  23. arXiv:2310.04363  [pdf, other

    cs.LG cs.CL

    Amortizing intractable inference in large language models

    Authors: Edward J. Hu, Moksh Jain, Eric Elmoznino, Younesse Kaddar, Guillaume Lajoie, Yoshua Bengio, Nikolay Malkin

    Abstract: Autoregressive large language models (LLMs) compress knowledge from their training data through next-token conditional distributions. This limits tractable querying of this knowledge to start-to-end autoregressive sampling. However, many tasks of interest -- including sequence continuation, infilling, and other forms of constrained generation -- involve sampling from intractable posterior distribu… ▽ More

    Submitted 13 March, 2024; v1 submitted 6 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; 23 pages; code: https://github.com/GFNOrg/gfn-lm-tuning

  24. arXiv:2310.02779  [pdf, other

    cs.LG cs.GT

    Expected flow networks in stochastic environments and two-player zero-sum games

    Authors: Marco Jiralerspong, Bilun Sun, Danilo Vucetic, Tianyu Zhang, Yoshua Bengio, Gauthier Gidel, Nikolay Malkin

    Abstract: Generative flow networks (GFlowNets) are sequential sampling models trained to match a given distribution. GFlowNets have been successfully applied to various structured object generation tasks, sampling a diverse set of high-reward objects quickly. We propose expected flow networks (EFlowNets), which extend GFlowNets to stochastic environments. We show that EFlowNets outperform other GFlowNet for… ▽ More

    Submitted 13 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; code: https://github.com/GFNOrg/AdversarialFlowNetworks

  25. arXiv:2310.02423  [pdf, other

    cs.LG stat.ML

    Delta-AI: Local objectives for amortized inference in sparse graphical models

    Authors: Jean-Pierre Falet, Hae Beom Lee, Nikolay Malkin, Chen Sun, Dragos Secrieru, Thomas Jiralerspong, Dinghuai Zhang, Guillaume Lajoie, Yoshua Bengio

    Abstract: We present a new algorithm for amortized inference in sparse probabilistic graphical models (PGMs), which we call $Δ$-amortized inference ($Δ$-AI). Our approach is based on the observation that when the sampling of variables in a PGM is seen as a sequence of actions taken by an agent, sparsity of the PGM enables local credit assignment in the agent's policy learning objective. This yields a local… ▽ More

    Submitted 13 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; 19 pages, code: https://github.com/GFNOrg/Delta-AI/

  26. arXiv:2310.01807  [pdf, other

    cs.AI cs.LG

    Discrete, compositional, and symbolic representations through attractor dynamics

    Authors: Andrew Nam, Eric Elmoznino, Nikolay Malkin, James McClelland, Yoshua Bengio, Guillaume Lajoie

    Abstract: Symbolic systems are powerful frameworks for modeling cognitive processes as they encapsulate the rules and relationships fundamental to many aspects of human reasoning and behavior. Central to these models are systematicity, compositionality, and productivity, making them invaluable in both cognitive science and artificial intelligence. However, certain limitations remain. For instance, the integ… ▽ More

    Submitted 26 September, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

  27. arXiv:2307.03672  [pdf, other

    cs.LG

    Simulation-free Schrödinger bridges via score and flow matching

    Authors: Alexander Tong, Nikolay Malkin, Kilian Fatras, Lazar Atanackovic, Yanlei Zhang, Guillaume Huguet, Guy Wolf, Yoshua Bengio

    Abstract: We present simulation-free score and flow matching ([SF]$^2$M), a simulation-free objective for inferring stochastic dynamics given unpaired samples drawn from arbitrary source and target distributions. Our method generalizes both the score-matching loss used in the training of diffusion models and the recently proposed flow matching loss used in the training of continuous normalizing flows. [SF]… ▽ More

    Submitted 11 March, 2024; v1 submitted 7 July, 2023; originally announced July 2023.

    Comments: AISTATS 2024. Code: https://github.com/atong01/conditional-flow-matching

  28. arXiv:2306.17693  [pdf, other

    cs.LG

    Thompson sampling for improved exploration in GFlowNets

    Authors: Jarrid Rector-Brooks, Kanika Madan, Moksh Jain, Maksym Korablyov, Cheng-Hao Liu, Sarath Chandar, Nikolay Malkin, Yoshua Bengio

    Abstract: Generative flow networks (GFlowNets) are amortized variational inference algorithms that treat sampling from a distribution over compositional objects as a sequential decision-making problem with a learnable action policy. Unlike other algorithms for hierarchical sampling that optimize a variational bound, GFlowNet algorithms can stably run off-policy, which can be advantageous for discovering mod… ▽ More

    Submitted 30 June, 2023; originally announced June 2023.

    Comments: Structured Probabilistic Inference and Generative Modeling (SPIGM) workshop @ ICML 2023

  29. arXiv:2306.15058  [pdf, other

    cs.LG stat.ML

    BatchGFN: Generative Flow Networks for Batch Active Learning

    Authors: Shreshth A. Malik, Salem Lahlou, Andrew Jesson, Moksh Jain, Nikolay Malkin, Tristan Deleu, Yoshua Bengio, Yarin Gal

    Abstract: We introduce BatchGFN -- a novel approach for pool-based active learning that uses generative flow networks to sample sets of data points proportional to a batch reward. With an appropriate reward function to quantify the utility of acquiring a batch, such as the joint mutual information between the batch and the model parameters, BatchGFN is able to construct highly informative batches for active… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted at the Structured Probabilistic Inference & Generative Modeling workshop, ICML 2023

  30. arXiv:2305.19366  [pdf, other

    cs.LG stat.ML

    Joint Bayesian Inference of Graphical Structure and Parameters with a Single Generative Flow Network

    Authors: Tristan Deleu, Mizu Nishikawa-Toomey, Jithendaraa Subramanian, Nikolay Malkin, Laurent Charlin, Yoshua Bengio

    Abstract: Generative Flow Networks (GFlowNets), a class of generative models over discrete and structured sample spaces, have been previously applied to the problem of inferring the marginal posterior distribution over the directed acyclic graph (DAG) of a Bayesian Network, given a dataset of observations. Based on recent advances extending this framework to non-discrete sample spaces, we propose in this pa… ▽ More

    Submitted 30 October, 2023; v1 submitted 30 May, 2023; originally announced May 2023.

  31. arXiv:2305.17010  [pdf, other

    cs.LG cs.AI cs.DM stat.ML

    Let the Flows Tell: Solving Graph Combinatorial Optimization Problems with GFlowNets

    Authors: Dinghuai Zhang, Hanjun Dai, Nikolay Malkin, Aaron Courville, Yoshua Bengio, Ling Pan

    Abstract: Combinatorial optimization (CO) problems are often NP-hard and thus out of reach for exact algorithms, making them a tempting domain to apply machine learning methods. The highly structured constraints in these problems can hinder either optimization or sampling directly in the solution space. On the other hand, GFlowNets have recently emerged as a powerful machinery to efficiently sample from com… ▽ More

    Submitted 20 November, 2023; v1 submitted 26 May, 2023; originally announced May 2023.

    Comments: Accepted by NeurIPS 2023 as spotlight

  32. arXiv:2302.06576  [pdf, other

    cs.LG stat.ML

    GFlowNet-EM for learning compositional latent variable models

    Authors: Edward J. Hu, Nikolay Malkin, Moksh Jain, Katie Everett, Alexandros Graikos, Yoshua Bengio

    Abstract: Latent variable models (LVMs) with discrete compositional latents are an important but challenging setting due to a combinatorially large number of possible configurations of the latents. A key tradeoff in modeling the posteriors over latents is between expressivity and tractable optimization. For algorithms based on expectation-maximization (EM), the E-step is often intractable without restrictiv… ▽ More

    Submitted 3 June, 2023; v1 submitted 13 February, 2023; originally announced February 2023.

    Comments: ICML 2023; code: https://github.com/GFNOrg/GFlowNet-EM

  33. arXiv:2302.01687  [pdf, other

    cs.LG

    Better Training of GFlowNets with Local Credit and Incomplete Trajectories

    Authors: Ling Pan, Nikolay Malkin, Dinghuai Zhang, Yoshua Bengio

    Abstract: Generative Flow Networks or GFlowNets are related to Monte-Carlo Markov chain methods (as they sample from a distribution specified by an energy function), reinforcement learning (as they learn a policy to sample composed objects through a sequence of steps), generative models (as they learn to represent and sample from a distribution) and amortized variational methods (as they can be used to lear… ▽ More

    Submitted 18 June, 2023; v1 submitted 3 February, 2023; originally announced February 2023.

    Comments: ICML 2023

  34. arXiv:2302.00482  [pdf, other

    cs.LG

    Improving and generalizing flow-based generative models with minibatch optimal transport

    Authors: Alexander Tong, Kilian Fatras, Nikolay Malkin, Guillaume Huguet, Yanlei Zhang, Jarrid Rector-Brooks, Guy Wolf, Yoshua Bengio

    Abstract: Continuous normalizing flows (CNFs) are an attractive generative modeling technique, but they have been held back by limitations in their simulation-based maximum likelihood training. We introduce the generalized conditional flow matching (CFM) technique, a family of simulation-free training objectives for CNFs. CFM features a stable regression objective like that used to train the stochastic flow… ▽ More

    Submitted 11 March, 2024; v1 submitted 1 February, 2023; originally announced February 2023.

    Comments: TMLR. Code: https://github.com/atong01/conditional-flow-matching

  35. arXiv:2301.12594  [pdf, other

    cs.LG stat.ML

    A theory of continuous generative flow networks

    Authors: Salem Lahlou, Tristan Deleu, Pablo Lemos, Dinghuai Zhang, Alexandra Volokhova, Alex Hernández-García, Léna Néhale Ezzine, Yoshua Bengio, Nikolay Malkin

    Abstract: Generative flow networks (GFlowNets) are amortized variational inference algorithms that are trained to sample from unnormalized target distributions over compositional objects. A key limitation of GFlowNets until this time has been that they are restricted to discrete spaces. We present a theory for generalized GFlowNets, which encompasses both existing discrete GFlowNets and ones with continuous… ▽ More

    Submitted 25 May, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: ICML 2023; 32 pages; code: https://github.com/saleml/continuous-gfn

  36. arXiv:2211.03812  [pdf, other

    astro-ph.IM cs.CV cs.LG

    Posterior samples of source galaxies in strong gravitational lenses with score-based priors

    Authors: Alexandre Adam, Adam Coogan, Nikolay Malkin, Ronan Legin, Laurence Perreault-Levasseur, Yashar Hezaveh, Yoshua Bengio

    Abstract: Inferring accurate posteriors for high-dimensional representations of the brightness of gravitationally-lensed sources is a major challenge, in part due to the difficulties of accurately quantifying the priors. Here, we report the use of a score-based model to encode the prior for the inference of undistorted images of background galaxies. This model is trained on a set of high-resolution images o… ▽ More

    Submitted 29 November, 2022; v1 submitted 7 November, 2022; originally announced November 2022.

    Comments: 5+6 pages, 3 figures, Accepted (poster + contributed talk) for the Machine Learning and the Physical Sciences Workshop at the 36th conference on Neural Information Processing Systems (NeurIPS 2022); Corrected style file and added authors checklist

  37. arXiv:2210.12928  [pdf, other

    cs.LG cs.AI

    GFlowOut: Dropout with Generative Flow Networks

    Authors: Dianbo Liu, Moksh Jain, Bonaventure Dossou, Qianli Shen, Salem Lahlou, Anirudh Goyal, Nikolay Malkin, Chris Emezue, Dinghuai Zhang, Nadhir Hassen, Xu Ji, Kenji Kawaguchi, Yoshua Bengio

    Abstract: Bayesian Inference offers principled tools to tackle many critical problems with modern neural networks such as poor calibration and generalization, and data inefficiency. However, scaling Bayesian inference to large architectures is challenging and requires restrictive approximations. Monte Carlo Dropout has been widely used as a relatively cheap way for approximate Inference and to estimate unce… ▽ More

    Submitted 23 June, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

  38. arXiv:2210.01293  [pdf, other

    cs.CL

    ThinkSum: Probabilistic reasoning over sets using large language models

    Authors: Batu Ozturkler, Nikolay Malkin, Zhen Wang, Nebojsa Jojic

    Abstract: Large language models (LLMs) have a substantial capacity for high-level analogical reasoning: reproducing patterns in linear text that occur in their training data (zero-shot evaluation) or in the provided context (few-shot in-context learning). However, recent studies show that even the more advanced LLMs fail in scenarios that require reasoning over multiple objects or facts and making sequences… ▽ More

    Submitted 2 June, 2023; v1 submitted 3 October, 2022; originally announced October 2022.

    Comments: ACL 2023

  39. arXiv:2210.00580  [pdf, other

    cs.LG stat.ML

    GFlowNets and variational inference

    Authors: Nikolay Malkin, Salem Lahlou, Tristan Deleu, Xu Ji, Edward Hu, Katie Everett, Dinghuai Zhang, Yoshua Bengio

    Abstract: This paper builds bridges between two families of probabilistic algorithms: (hierarchical) variational inference (VI), which is typically used to model distributions over continuous spaces, and generative flow networks (GFlowNets), which have been used for distributions over discrete structures such as graphs. We demonstrate that, in certain cases, VI algorithms are equivalent to special cases of… ▽ More

    Submitted 2 March, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: ICLR 2023 final version; code: https://github.com/GFNOrg/GFN_vs_HVI

  40. arXiv:2209.12782  [pdf, other

    cs.LG stat.ML

    Learning GFlowNets from partial episodes for improved convergence and stability

    Authors: Kanika Madan, Jarrid Rector-Brooks, Maksym Korablyov, Emmanuel Bengio, Moksh Jain, Andrei Nica, Tom Bosc, Yoshua Bengio, Nikolay Malkin

    Abstract: Generative flow networks (GFlowNets) are a family of algorithms for training a sequential sampler of discrete objects under an unnormalized target density and have been successfully used for various probabilistic modeling tasks. Existing training objectives for GFlowNets are either local to states or transitions, or propagate a reward signal over an entire sampling trajectory. We argue that these… ▽ More

    Submitted 3 June, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: ICML 2023

  41. arXiv:2209.02606  [pdf, other

    cs.LG cs.AI stat.ML

    Unifying Generative Models with GFlowNets and Beyond

    Authors: Dinghuai Zhang, Ricky T. Q. Chen, Nikolay Malkin, Yoshua Bengio

    Abstract: There are many frameworks for deep generative modeling, each often presented with their own specific training algorithms and inference methods. Here, we demonstrate the connections between existing deep generative models and the recently introduced GFlowNet framework, a probabilistic inference machine which treats sampling as a decision-making process. This analysis sheds light on their overlappin… ▽ More

    Submitted 30 January, 2023; v1 submitted 6 September, 2022; originally announced September 2022.

    Comments: expanded version of the ICML 2022 workshop paper

  42. arXiv:2206.09012  [pdf, other

    cs.LG cs.CV

    Diffusion models as plug-and-play priors

    Authors: Alexandros Graikos, Nikolay Malkin, Nebojsa Jojic, Dimitris Samaras

    Abstract: We consider the problem of inferring high-dimensional data $\mathbf{x}$ in a model that consists of a prior $p(\mathbf{x})$ and an auxiliary differentiable constraint $c(\mathbf{x},\mathbf{y})$ on $x$ given some additional information $\mathbf{y}$. In this paper, the prior is an independently trained denoising diffusion generative model. The auxiliary constraint is expected to have a differentiabl… ▽ More

    Submitted 8 January, 2023; v1 submitted 17 June, 2022; originally announced June 2022.

    Comments: NeurIPS 2022; code: https://github.com/AlexGraikos/diffusion_priors

  43. arXiv:2202.14000  [pdf, other

    cs.LG stat.ML

    Resolving label uncertainty with implicit posterior models

    Authors: Esther Rolf, Nikolay Malkin, Alexandros Graikos, Ana Jojic, Caleb Robinson, Nebojsa Jojic

    Abstract: We propose a method for jointly inferring labels across a collection of data samples, where each sample consists of an observation and a prior belief about the label. By implicitly assuming the existence of a generative model for which a differentiable predictor is the posterior, we derive a training objective that allows learning under weak beliefs. This formulation unifies various machine learni… ▽ More

    Submitted 17 June, 2022; v1 submitted 28 February, 2022; originally announced February 2022.

    Comments: UAI 2022; code: https://github.com/estherrolf/implicit-posterior

  44. arXiv:2202.01361  [pdf, other

    cs.LG stat.ML

    Generative Flow Networks for Discrete Probabilistic Modeling

    Authors: Dinghuai Zhang, Nikolay Malkin, Zhen Liu, Alexandra Volokhova, Aaron Courville, Yoshua Bengio

    Abstract: We present energy-based generative flow networks (EB-GFN), a novel probabilistic modeling algorithm for high-dimensional discrete data. Building upon the theory of generative flow networks (GFlowNets), we model the generation process by a stochastic data construction policy and thus amortize expensive MCMC exploration into a fixed number of actions sampled from a GFlowNet. We show how GFlowNets ca… ▽ More

    Submitted 8 June, 2022; v1 submitted 2 February, 2022; originally announced February 2022.

    Comments: Accepted by ICML 2022

  45. arXiv:2201.13259  [pdf, other

    cs.LG stat.ML

    Trajectory balance: Improved credit assignment in GFlowNets

    Authors: Nikolay Malkin, Moksh Jain, Emmanuel Bengio, Chen Sun, Yoshua Bengio

    Abstract: Generative flow networks (GFlowNets) are a method for learning a stochastic policy for generating compositional objects, such as graphs or strings, from a given unnormalized density by sequences of actions, where many possible action sequences may lead to the same object. We find previously proposed learning objectives for GFlowNets, flow matching and detailed balance, which are analogous to tempo… ▽ More

    Submitted 4 October, 2023; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: NeurIPS 2022; see footnotes for code; v3 fixes minor errata

  46. arXiv:2112.07047  [pdf, ps, other

    cs.CY

    SoK: A Framework for Unifying At-Risk User Research

    Authors: Noel Warford, Tara Matthews, Kaitlyn Yang, Omer Akgul, Sunny Consolvo, Patrick Gage Kelley, Nathan Malkin, Michelle L. Mazurek, Manya Sleeper, Kurt Thomas

    Abstract: At-risk users are people who experience elevated digital security, privacy, and safety threats because of what they do, who they are, where they are, or who they are with. In this systematization work, we present a framework for reasoning about at-risk users based on a wide-ranging meta-analysis of 85 papers. Across the varied populations that we examined (e.g., children, activists, women in devel… ▽ More

    Submitted 13 December, 2021; originally announced December 2021.

    Comments: 18 pages, 2 tables

  47. arXiv:2110.08294  [pdf, other

    cs.CL

    Coherence boosting: When your pretrained language model is not paying enough attention

    Authors: Nikolay Malkin, Zhen Wang, Nebojsa Jojic

    Abstract: Long-range semantic coherence remains a challenge in automatic language generation and understanding. We demonstrate that large language models have insufficiently learned the effect of distant words on next-token prediction. We present coherence boosting, an inference procedure that increases a LM's focus on a long context. We show the benefits of coherence boosting with pretrained models by dist… ▽ More

    Submitted 16 March, 2022; v1 submitted 15 October, 2021; originally announced October 2021.

    Comments: ACL 2022; code: https://github.com/zhenwang9102/coherence-boosting

  48. arXiv:2109.04867  [pdf, other

    cs.CL

    Studying word order through iterative shuffling

    Authors: Nikolay Malkin, Sameera Lanka, Pranav Goel, Nebojsa Jojic

    Abstract: As neural language models approach human performance on NLP benchmark tasks, their advances are widely seen as evidence of an increasingly complex understanding of syntax. This view rests upon a hypothesis that has not yet been empirically tested: that word order encodes meaning essential to performing these tasks. We refute this hypothesis in many cases: in the GLUE suite and in various genres of… ▽ More

    Submitted 10 September, 2021; originally announced September 2021.

    Comments: EMNLP 2021

  49. arXiv:2101.01154  [pdf, other

    cs.CV cs.AI cs.LG

    High-resolution land cover change from low-resolution labels: Simple baselines for the 2021 IEEE GRSS Data Fusion Contest

    Authors: Nikolay Malkin, Caleb Robinson, Nebojsa Jojic

    Abstract: We present simple algorithms for land cover change detection in the 2021 IEEE GRSS Data Fusion Contest. The task of the contest is to create high-resolution (1m / pixel) land cover change maps of a study area in Maryland, USA, given multi-resolution imagery and label data. We study several baseline models for this task and discuss directions for further research. See https://dfc2021.blob.core.wi… ▽ More

    Submitted 4 January, 2021; originally announced January 2021.

    Comments: 10 pages

  50. arXiv:2004.11498  [pdf, other

    cs.CV

    Mining self-similarity: Label super-resolution with epitomic representations

    Authors: Nikolay Malkin, Anthony Ortiz, Caleb Robinson, Nebojsa Jojic

    Abstract: We show that simple patch-based models, such as epitomes, can have superior performance to the current state of the art in semantic segmentation and label super-resolution, which uses deep convolutional neural networks. We derive a new training algorithm for epitomes which allows, for the first time, learning from very large data sets and derive a label super-resolution algorithm as a statistical… ▽ More

    Submitted 13 December, 2021; v1 submitted 23 April, 2020; originally announced April 2020.

    Comments: ECCV 2020 final version