Skip to main content

Showing 1–36 of 36 results for author: Bengio, E

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.14613  [pdf, ps, other

    cs.LG q-bio.QM

    Virtual Cells: Predict, Explain, Discover

    Authors: Emmanuel Noutahi, Jason Hartford, Prudencio Tossou, Shawn Whitfield, Alisandra K. Denton, Cas Wognum, Kristina Ulicna, Michael Craig, Jonathan Hsu, Michael Cuccarese, Emmanuel Bengio, Dominique Beaini, Christopher Gibson, Daniel Cohen, Berton Earnshaw

    Abstract: Drug discovery is fundamentally a process of inferring the effects of treatments on patients, and would therefore benefit immensely from computational models that can reliably simulate patient responses, enabling researchers to generate and test large numbers of therapeutic hypotheses safely and economically before initiating costly clinical trials. Even a more specific model that predicts the fun… ▽ More

    Submitted 4 June, 2025; v1 submitted 20 May, 2025; originally announced May 2025.

  2. arXiv:2503.09746  [pdf, other

    cs.LG cs.AI stat.ML

    Solving Bayesian inverse problems with diffusion priors and off-policy RL

    Authors: Luca Scimeca, Siddarth Venkatraman, Moksh Jain, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yashar Hezaveh, Laurence Perreault-Levasseur, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: This paper presents a practical application of Relative Trajectory Balance (RTB), a recently introduced off-policy reinforcement learning (RL) objective that can asymptotically solve Bayesian inverse problems optimally. We extend the original work by using RTB to train conditional diffusion model posteriors from pretrained unconditional priors for challenging linear and non-linear inverse problems… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

    Comments: Accepted as workshop paper at DeLTa workshop, ICLR 2025. arXiv admin note: substantial text overlap with arXiv:2405.20971

  3. arXiv:2503.06337  [pdf, ps, other

    cs.LG

    Pretraining Generative Flow Networks with Inexpensive Rewards for Molecular Graph Generation

    Authors: Mohit Pandey, Gopeshh Subbaraj, Artem Cherkasov, Martin Ester, Emmanuel Bengio

    Abstract: Generative Flow Networks (GFlowNets) have recently emerged as a suitable framework for generating diverse and high-quality molecular structures by learning from rewards treated as unnormalized distributions. Previous works in this framework often restrict exploration by using predefined molecular fragments as building blocks, limiting the chemical space that can be accessed. In this work, we intro… ▽ More

    Submitted 12 June, 2025; v1 submitted 8 March, 2025; originally announced March 2025.

    Comments: arXiv admin note: text overlap with arXiv:2409.09702

  4. arXiv:2410.19631  [pdf, other

    cs.LG

    Efficient Biological Data Acquisition through Inference Set Design

    Authors: Ihor Neporozhnii, Julien Roy, Emmanuel Bengio, Jason Hartford

    Abstract: In drug discovery, highly automated high-throughput laboratories are used to screen a large number of compounds in search of effective drugs. These experiments are expensive, so one might hope to reduce their cost by only experimenting on a subset of the compounds, and predicting the outcomes of the remaining experiments. In this work, we model this scenario as a sequential subset selection proble… ▽ More

    Submitted 11 April, 2025; v1 submitted 25 October, 2024; originally announced October 2024.

  5. arXiv:2410.15184  [pdf, other

    cs.LG cs.AI

    Action abstractions for amortized sampling

    Authors: Oussama Boussif, Léna Néhale Ezzine, Joseph D Viviano, Michał Koziarski, Moksh Jain, Nikolay Malkin, Emmanuel Bengio, Rim Assouel, Yoshua Bengio

    Abstract: As trajectories sampled by policies used by reinforcement learning (RL) and generative flow networks (GFlowNets) grow longer, credit assignment and exploration become more challenging, and the long planning horizon hinders mode discovery and generalization. The challenge is particularly pronounced in entropy-seeking RL methods, such as generative flow networks, where the agent must learn to sample… ▽ More

    Submitted 19 October, 2024; originally announced October 2024.

  6. arXiv:2410.04461  [pdf, ps, other

    cs.LG q-bio.BM

    Improved Off-policy Reinforcement Learning in Biological Sequence Design

    Authors: Hyeonah Kim, Minsu Kim, Taeyoung Yun, Sanghyeok Choi, Emmanuel Bengio, Alex Hernández-García, Jinkyoo Park

    Abstract: Designing biological sequences with desired properties is challenging due to vast search spaces and limited evaluation budgets. Although reinforcement learning methods use proxy models for rapid reward evaluation, insufficient training data can cause proxy misspecification on out-of-distribution inputs. To address this, we propose a novel off-policy search, $δ$-Conservative Search, that enhances r… ▽ More

    Submitted 16 June, 2025; v1 submitted 6 October, 2024; originally announced October 2024.

    Comments: ICML 2025

  7. arXiv:2410.01432  [pdf, other

    cs.LG stat.ML

    Adaptive teachers for amortized samplers

    Authors: Minsu Kim, Sanghyeok Choi, Taeyoung Yun, Emmanuel Bengio, Leo Feng, Jarrid Rector-Brooks, Sungsoo Ahn, Jinkyoo Park, Nikolay Malkin, Yoshua Bengio

    Abstract: Amortized inference is the task of training a parametric model, such as a neural network, to approximate a distribution with a given unnormalized density where exact sampling is intractable. When sampling is implemented as a sequential decision-making process, reinforcement learning (RL) methods, such as generative flow networks, can be used to train the sampling policy. Off-policy RL training fac… ▽ More

    Submitted 14 April, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: ICLR 2025, 27 pages, 12 figures

  8. arXiv:2409.09702  [pdf, other

    cs.LG cs.AI q-bio.BM

    GFlowNet Pretraining with Inexpensive Rewards

    Authors: Mohit Pandey, Gopeshh Subbaraj, Emmanuel Bengio

    Abstract: Generative Flow Networks (GFlowNets), a class of generative models have recently emerged as a suitable framework for generating diverse and high-quality molecular structures by learning from unnormalized reward distributions. Previous works in this direction often restrict exploration by using predefined molecular fragments as building blocks, limiting the chemical space that can be accessed. In t… ▽ More

    Submitted 15 September, 2024; originally announced September 2024.

  9. arXiv:2406.05426  [pdf, other

    cs.LG

    Baking Symmetry into GFlowNets

    Authors: George Ma, Emmanuel Bengio, Yoshua Bengio, Dinghuai Zhang

    Abstract: GFlowNets have exhibited promising performance in generating diverse candidates with high rewards. These networks generate objects incrementally and aim to learn a policy that assigns probability of sampling objects in proportion to rewards. However, the current training pipelines of GFlowNets do not consider the presence of isomorphic actions, which are actions resulting in symmetric or isomorphi… ▽ More

    Submitted 8 June, 2024; originally announced June 2024.

  10. arXiv:2406.02213  [pdf, ps, other

    cs.LG

    Random Policy Evaluation Uncovers Policies of Generative Flow Networks

    Authors: Haoran He, Emmanuel Bengio, Qingpeng Cai, Ling Pan

    Abstract: The Generative Flow Network (GFlowNet) is a probabilistic framework in which an agent learns a stochastic policy and flow functions to sample objects proportionally to an unnormalized reward function. A number of recent works explored connections between GFlowNets and maximum entropy (MaxEnt) RL, which modifies the standard objective of RL agents by learning an entropy-regularized objective. Howev… ▽ More

    Submitted 2 June, 2025; v1 submitted 4 June, 2024; originally announced June 2024.

  11. arXiv:2405.20971  [pdf, other

    cs.LG cs.CV

    Amortizing intractable inference in diffusion models for vision, language, and control

    Authors: Siddarth Venkatraman, Moksh Jain, Luca Scimeca, Minsu Kim, Marcin Sendera, Mohsin Hasan, Luke Rowe, Sarthak Mittal, Pablo Lemos, Emmanuel Bengio, Alexandre Adam, Jarrid Rector-Brooks, Yoshua Bengio, Glen Berseth, Nikolay Malkin

    Abstract: Diffusion models have emerged as effective distribution estimators in vision, language, and reinforcement learning, but their use as priors in downstream tasks poses an intractable posterior inference problem. This paper studies amortized sampling of the posterior over data, $\mathbf{x}\sim p^{\rm post}(\mathbf{x})\propto p(\mathbf{x})r(\mathbf{x})$, in a model that consists of a diffusion generat… ▽ More

    Submitted 13 January, 2025; v1 submitted 31 May, 2024; originally announced May 2024.

    Comments: NeurIPS 2024; code: https://github.com/GFNOrg/diffusion-finetuning

  12. arXiv:2405.01616  [pdf, other

    q-bio.BM cs.AI cs.LG

    Generative Active Learning for the Search of Small-molecule Protein Binders

    Authors: Maksym Korablyov, Cheng-Hao Liu, Moksh Jain, Almer M. van der Sloot, Eric Jolicoeur, Edward Ruediger, Andrei Cristian Nica, Emmanuel Bengio, Kostiantyn Lapchevskyi, Daniel St-Cyr, Doris Alexandra Schuetz, Victor Ion Butoi, Jarrid Rector-Brooks, Simon Blackburn, Leo Feng, Hadi Nekoei, SaiKrishna Gottipati, Priyesh Vijayan, Prateek Gupta, Ladislav Rampášek, Sasikanth Avancha, Pierre-Luc Bacon, William L. Hamilton, Brooks Paige, Sanchit Misra , et al. (9 additional authors not shown)

    Abstract: Despite substantial progress in machine learning for scientific discovery in recent years, truly de novo design of small molecules which exhibit a property of interest remains a significant challenge. We introduce LambdaZero, a generative active learning approach to search for synthesizable molecules. Powered by deep reinforcement learning, LambdaZero learns to search over the vast space of molecu… ▽ More

    Submitted 2 May, 2024; originally announced May 2024.

  13. arXiv:2405.01155  [pdf, other

    cs.LG q-bio.BM

    SynFlowNet: Design of Diverse and Novel Molecules with Synthesis Constraints

    Authors: Miruna Cretu, Charles Harris, Ilia Igashov, Arne Schneuing, Marwin Segler, Bruno Correia, Julien Roy, Emmanuel Bengio, Pietro Liò

    Abstract: Generative models see increasing use in computer-aided drug design. However, while performing well at capturing distributions of molecular motifs, they often produce synthetically inaccessible molecules. To address this, we introduce SynFlowNet, a GFlowNet model whose action space uses chemical reactions and purchasable reactants to sequentially build new molecules. By incorporating forward synthe… ▽ More

    Submitted 9 April, 2025; v1 submitted 2 May, 2024; originally announced May 2024.

    Journal ref: ICLR 2025: https://openreview.net/forum?id=uvHmnahyp1

  14. arXiv:2402.05309  [pdf, other

    cs.LG

    Investigating Generalization Behaviours of Generative Flow Networks

    Authors: Lazar Atanackovic, Emmanuel Bengio

    Abstract: Generative Flow Networks (GFlowNets, GFNs) are a generative framework for learning unnormalized probability mass functions over discrete spaces. Since their inception, GFlowNets have proven to be useful for learning generative models in applications where the majority of the discrete space is unvisited during training. This has inspired some to hypothesize that GFlowNets, when paired with deep neu… ▽ More

    Submitted 16 April, 2025; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Accepted to TMLR

  15. arXiv:2402.05234  [pdf, other

    cs.LG

    QGFN: Controllable Greediness with Action Values

    Authors: Elaine Lau, Stephen Zhewen Lu, Ling Pan, Doina Precup, Emmanuel Bengio

    Abstract: Generative Flow Networks (GFlowNets; GFNs) are a family of energy-based generative methods for combinatorial objects, capable of generating diverse and high-utility samples. However, consistently biasing GFNs towards producing high-utility samples is non-trivial. In this work, we leverage connections between GFNs and reinforcement learning (RL) and propose to combine the GFN policy with an action-… ▽ More

    Submitted 1 November, 2024; v1 submitted 7 February, 2024; originally announced February 2024.

    Comments: Accepted by 38th Conference on Neural Information Processing Systems (NeurIPS 2024)

  16. arXiv:2312.14331  [pdf, other

    cs.LG

    Maximum entropy GFlowNets with soft Q-learning

    Authors: Sobhan Mohammadpour, Emmanuel Bengio, Emma Frejinger, Pierre-Luc Bacon

    Abstract: Generative Flow Networks (GFNs) have emerged as a powerful tool for sampling discrete objects from unnormalized distributions, offering a scalable alternative to Markov Chain Monte Carlo (MCMC) methods. While GFNs draw inspiration from maximum entropy reinforcement learning (RL), the connection between the two has largely been unclear and seemingly applicable only in specific cases. This paper add… ▽ More

    Submitted 2 May, 2024; v1 submitted 21 December, 2023; originally announced December 2023.

    Journal ref: 2024 Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, in Proceedings of Machine Learning Research 238:2593-2601

  17. arXiv:2310.19685  [pdf, other

    cs.LG q-bio.BM

    DGFN: Double Generative Flow Networks

    Authors: Elaine Lau, Nikhil Vemgal, Doina Precup, Emmanuel Bengio

    Abstract: Deep learning is emerging as an effective tool in drug discovery, with potential applications in both predictive and generative models. Generative Flow Networks (GFlowNets/GFNs) are a recently introduced method recognized for the ability to generate diverse candidates, in particular in small molecule generation tasks. In this work, we introduce double GFlowNets (DGFNs). Drawing inspiration from re… ▽ More

    Submitted 6 November, 2023; v1 submitted 30 October, 2023; originally announced October 2023.

    Comments: Accepted to NeurIPS 2023 Workshop

  18. arXiv:2310.02823  [pdf, other

    cs.LG stat.ML

    Learning to Scale Logits for Temperature-Conditional GFlowNets

    Authors: Minsu Kim, Joohwan Ko, Taeyoung Yun, Dinghuai Zhang, Ling Pan, Woochang Kim, Jinkyoo Park, Emmanuel Bengio, Yoshua Bengio

    Abstract: GFlowNets are probabilistic models that sequentially generate compositional structures through a stochastic policy. Among GFlowNets, temperature-conditional GFlowNets can introduce temperature-based controllability for exploration and exploitation. We propose \textit{Logit-scaling GFlowNets} (Logit-GFN), a novel architectural design that greatly accelerates the training of temperature-conditional… ▽ More

    Submitted 2 June, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: ICML 2024, 23 pages, 21 figures

  19. arXiv:2310.02710  [pdf, other

    cs.LG stat.ML

    Local Search GFlowNets

    Authors: Minsu Kim, Taeyoung Yun, Emmanuel Bengio, Dinghuai Zhang, Yoshua Bengio, Sungsoo Ahn, Jinkyoo Park

    Abstract: Generative Flow Networks (GFlowNets) are amortized sampling methods that learn a distribution over discrete objects proportional to their rewards. GFlowNets exhibit a remarkable ability to generate diverse samples, yet occasionally struggle to consistently produce samples with high rewards due to over-exploration on wide sample space. This paper proposes to train GFlowNets with local search, which… ▽ More

    Submitted 22 March, 2024; v1 submitted 4 October, 2023; originally announced October 2023.

    Comments: ICLR 2024 (Spotlight paper), 18 pages, 17 figures

  20. arXiv:2306.04620  [pdf, other

    cs.LG q-bio.BM

    Goal-conditioned GFlowNets for Controllable Multi-Objective Molecular Design

    Authors: Julien Roy, Pierre-Luc Bacon, Christopher Pal, Emmanuel Bengio

    Abstract: In recent years, in-silico molecular design has received much attention from the machine learning community. When designing a new compound for pharmaceutical applications, there are usually multiple properties of such molecules that need to be optimised: binding energy to the target, synthesizability, toxicity, EC50, and so on. While previous approaches have employed a scalarization scheme to turn… ▽ More

    Submitted 29 June, 2023; v1 submitted 7 June, 2023; originally announced June 2023.

    Comments: 14 pages

  21. arXiv:2305.07170  [pdf, other

    cs.LG

    Towards Understanding and Improving GFlowNet Training

    Authors: Max W. Shen, Emmanuel Bengio, Ehsan Hajiramezanali, Andreas Loukas, Kyunghyun Cho, Tommaso Biancalani

    Abstract: Generative flow networks (GFlowNets) are a family of algorithms that learn a generative policy to sample discrete objects $x$ with non-negative reward $R(x)$. Learning objectives guarantee the GFlowNet samples $x$ from the target distribution $p^*(x) \propto R(x)$ when loss is globally minimized over all states or trajectories, but it is unclear how well they perform with practical limits on train… ▽ More

    Submitted 11 May, 2023; originally announced May 2023.

    Comments: Accepted to ICML 2023

  22. arXiv:2210.12765  [pdf, other

    cs.LG stat.ML

    Multi-Objective GFlowNets

    Authors: Moksh Jain, Sharath Chandra Raparthy, Alex Hernandez-Garcia, Jarrid Rector-Brooks, Yoshua Bengio, Santiago Miret, Emmanuel Bengio

    Abstract: We study the problem of generating diverse candidates in the context of Multi-Objective Optimization. In many applications of machine learning such as drug discovery and material design, the goal is to generate candidates which simultaneously optimize a set of potentially conflicting objectives. Moreover, these objectives are often imperfect evaluations of some underlying property of interest, mak… ▽ More

    Submitted 17 July, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

    Comments: 23 pages, 8 figures. ICML 2023. Code at: https://github.com/GFNOrg/multi-objective-gfn

  23. arXiv:2209.12782  [pdf, other

    cs.LG stat.ML

    Learning GFlowNets from partial episodes for improved convergence and stability

    Authors: Kanika Madan, Jarrid Rector-Brooks, Maksym Korablyov, Emmanuel Bengio, Moksh Jain, Andrei Nica, Tom Bosc, Yoshua Bengio, Nikolay Malkin

    Abstract: Generative flow networks (GFlowNets) are a family of algorithms for training a sequential sampler of discrete objects under an unnormalized target density and have been successfully used for various probabilistic modeling tasks. Existing training objectives for GFlowNets are either local to states or transitions, or propagate a reward signal over an entire sampling trajectory. We argue that these… ▽ More

    Submitted 3 June, 2023; v1 submitted 26 September, 2022; originally announced September 2022.

    Comments: ICML 2023

  24. arXiv:2203.04115  [pdf, other

    q-bio.BM cs.LG

    Biological Sequence Design with GFlowNets

    Authors: Moksh Jain, Emmanuel Bengio, Alex-Hernandez Garcia, Jarrid Rector-Brooks, Bonaventure F. P. Dossou, Chanakya Ekbote, Jie Fu, Tianyu Zhang, Micheal Kilgour, Dinghuai Zhang, Lena Simine, Payel Das, Yoshua Bengio

    Abstract: Design of de novo biological sequences with desired properties, like protein and DNA sequences, often involves an active loop with several rounds of molecule ideation and expensive wet-lab evaluations. These experiments can consist of multiple stages, with increasing levels of precision and cost of evaluation, where candidates are filtered. This makes the diversity of proposed candidates a key con… ▽ More

    Submitted 24 May, 2023; v1 submitted 2 March, 2022; originally announced March 2022.

    Comments: ICML 2022. 15 pages, 3 figures. Code available at: https://github.com/MJ10/BioSeq-GFN-AL. Updated GFP results

  25. arXiv:2201.13259  [pdf, other

    cs.LG stat.ML

    Trajectory balance: Improved credit assignment in GFlowNets

    Authors: Nikolay Malkin, Moksh Jain, Emmanuel Bengio, Chen Sun, Yoshua Bengio

    Abstract: Generative flow networks (GFlowNets) are a method for learning a stochastic policy for generating compositional objects, such as graphs or strings, from a given unnormalized density by sequences of actions, where many possible action sequences may lead to the same object. We find previously proposed learning objectives for GFlowNets, flow matching and detailed balance, which are analogous to tempo… ▽ More

    Submitted 4 October, 2023; v1 submitted 31 January, 2022; originally announced January 2022.

    Comments: NeurIPS 2022; see footnotes for code; v3 fixes minor errata

  26. arXiv:2111.09266  [pdf, other

    cs.LG cs.AI stat.ML

    GFlowNet Foundations

    Authors: Yoshua Bengio, Salem Lahlou, Tristan Deleu, Edward J. Hu, Mo Tiwari, Emmanuel Bengio

    Abstract: Generative Flow Networks (GFlowNets) have been introduced as a method to sample a diverse set of candidates in an active learning context, with a training objective that makes them approximately sample in proportion to a given reward function. In this paper, we show a number of additional theoretical properties of GFlowNets. They can be used to estimate joint probability distributions and the corr… ▽ More

    Submitted 10 July, 2023; v1 submitted 17 November, 2021; originally announced November 2021.

  27. arXiv:2106.04399  [pdf, other

    cs.LG

    Flow Network based Generative Models for Non-Iterative Diverse Candidate Generation

    Authors: Emmanuel Bengio, Moksh Jain, Maksym Korablyov, Doina Precup, Yoshua Bengio

    Abstract: This paper is about the problem of learning a stochastic policy for generating an object (like a molecular graph) from a sequence of actions, such that the probability of generating an object is proportional to a given positive reward for that object. Whereas standard return maximization tends to converge to a single return-maximizing sequence, there are cases where we would like to sample a diver… ▽ More

    Submitted 19 November, 2021; v1 submitted 8 June, 2021; originally announced June 2021.

    Comments: Accepted at NeurIPS 2021

  28. arXiv:2106.03955  [pdf, other

    cs.LG stat.ML

    Correcting Momentum in Temporal Difference Learning

    Authors: Emmanuel Bengio, Joelle Pineau, Doina Precup

    Abstract: A common optimization tool used in deep reinforcement learning is momentum, which consists in accumulating and discounting past gradients, reapplying them at each iteration. We argue that, unlike in supervised learning, momentum in Temporal Difference (TD) learning accumulates gradients that become doubly stale: not only does the gradient of the loss change due to parameter updates, the loss itsel… ▽ More

    Submitted 7 June, 2021; originally announced June 2021.

    Comments: NeurIPS Deep RL Workshop 2020

  29. arXiv:2007.02786  [pdf, other

    cs.LG stat.ML

    TDprop: Does Jacobi Preconditioning Help Temporal Difference Learning?

    Authors: Joshua Romoff, Peter Henderson, David Kanaa, Emmanuel Bengio, Ahmed Touati, Pierre-Luc Bacon, Joelle Pineau

    Abstract: We investigate whether Jacobi preconditioning, accounting for the bootstrap term in temporal difference (TD) learning, can help boost performance of adaptive optimizers. Our method, TDprop, computes a per parameter learning rate based on the diagonal preconditioning of the TD update rule. We show how this can be used in both $n$-step returns and TD($λ$). Our theoretical findings demonstrate that i… ▽ More

    Submitted 6 July, 2020; originally announced July 2020.

    Comments: Presented at the Theoretical Foundations of Reinforcement Learning workshop at ICML 2020

  30. arXiv:2003.06350  [pdf, other

    cs.LG stat.ML

    Interference and Generalization in Temporal Difference Learning

    Authors: Emmanuel Bengio, Joelle Pineau, Doina Precup

    Abstract: We study the link between generalization and interference in temporal-difference (TD) learning. Interference is defined as the inner product of two different gradients, representing their alignment. This quantity emerges as being of interest from a variety of observations about neural networks, parameter sharing and the dynamics of learning. We find that TD easily leads to low-interference, under-… ▽ More

    Submitted 13 March, 2020; originally announced March 2020.

    Comments: Submitted to ICML 2020. 20 pages, 14 figures

  31. arXiv:1807.04270  [pdf, other

    physics.bio-ph cs.LG stat.ML

    Attack and defence in cellular decision-making: lessons from machine learning

    Authors: Thomas J. Rademaker, Emmanuel Bengio, Paul François

    Abstract: Machine learning algorithms can be fooled by small well-designed adversarial perturbations. This is reminiscent of cellular decision-making where ligands (called antagonists) prevent correct signalling, like in early immune recognition. We draw a formal analogy between neural networks used in machine learning and models of cellular decision-making (adaptive proofreading). We apply attacks from mac… ▽ More

    Submitted 13 June, 2019; v1 submitted 10 July, 2018; originally announced July 2018.

    Journal ref: Phys. Rev. X 9, 031012 (2019)

  32. arXiv:1802.09484  [pdf, other

    stat.ML cs.LG

    Disentangling the independently controllable factors of variation by interacting with the world

    Authors: Valentin Thomas, Emmanuel Bengio, William Fedus, Jules Pondard, Philippe Beaudoin, Hugo Larochelle, Joelle Pineau, Doina Precup, Yoshua Bengio

    Abstract: It has been postulated that a good representation is one that disentangles the underlying explanatory factors of variation. However, it remains an open question what kind of training framework could potentially achieve that. Whereas most previous work focuses on the static setting (e.g., with images), we postulate that some of the causal factors could be discovered if the learner is allowed to int… ▽ More

    Submitted 26 February, 2018; originally announced February 2018.

    Comments: Presented at NIPS 2017 Learning Disentangling Representations Workshop

  33. arXiv:1708.01289  [pdf, other

    cs.LG cs.AI stat.ML

    Independently Controllable Factors

    Authors: Valentin Thomas, Jules Pondard, Emmanuel Bengio, Marc Sarfati, Philippe Beaudoin, Marie-Jean Meurs, Joelle Pineau, Doina Precup, Yoshua Bengio

    Abstract: It has been postulated that a good representation is one that disentangles the underlying explanatory factors of variation. However, it remains an open question what kind of training framework could potentially achieve that. Whereas most previous work focuses on the static setting (e.g., with images), we postulate that some of the causal factors could be discovered if the learner is allowed to int… ▽ More

    Submitted 25 August, 2017; v1 submitted 3 August, 2017; originally announced August 2017.

  34. arXiv:1706.05394  [pdf, other

    stat.ML cs.LG

    A Closer Look at Memorization in Deep Networks

    Authors: Devansh Arpit, Stanisław Jastrzębski, Nicolas Ballas, David Krueger, Emmanuel Bengio, Maxinder S. Kanwal, Tegan Maharaj, Asja Fischer, Aaron Courville, Yoshua Bengio, Simon Lacoste-Julien

    Abstract: We examine the role of memorization in deep learning, drawing connections to capacity, generalization, and adversarial robustness. While deep networks are capable of memorizing noise data, our results suggest that they tend to prioritize learning simple patterns first. In our experiments, we expose qualitative differences in gradient-based optimization of deep neural networks (DNNs) on noise vs. r… ▽ More

    Submitted 1 July, 2017; v1 submitted 16 June, 2017; originally announced June 2017.

    Comments: Appears in Proceedings of the 34th International Conference on Machine Learning (ICML 2017), Devansh Arpit, Stanisław Jastrzębski, Nicolas Ballas, and David Krueger contributed equally to this work

  35. arXiv:1703.07718  [pdf, other

    cs.LG

    Independently Controllable Features

    Authors: Emmanuel Bengio, Valentin Thomas, Joelle Pineau, Doina Precup, Yoshua Bengio

    Abstract: Finding features that disentangle the different causes of variation in real data is a difficult task, that has nonetheless received considerable attention in static domains like natural images. Interactive environments, in which an agent can deliberately take actions, offer an opportunity to tackle this task better, because the agent can experiment with different actions and observe their effects.… ▽ More

    Submitted 22 March, 2017; originally announced March 2017.

    Comments: RLDM submission

  36. arXiv:1511.06297  [pdf, other

    cs.LG

    Conditional Computation in Neural Networks for faster models

    Authors: Emmanuel Bengio, Pierre-Luc Bacon, Joelle Pineau, Doina Precup

    Abstract: Deep learning has become the state-of-art tool in many applications, but the evaluation and training of deep models can be time-consuming and computationally expensive. The conditional computation approach has been proposed to tackle this problem (Bengio et al., 2013; Davis & Arel, 2013). It operates by selectively activating only parts of the network at a time. In this paper, we use reinforcement… ▽ More

    Submitted 7 January, 2016; v1 submitted 19 November, 2015; originally announced November 2015.

    Comments: ICLR 2016 submission, revised