Skip to main content

Showing 1–12 of 12 results for author: Jiralerspong, T

.
  1. arXiv:2502.10236  [pdf, other

    cs.LG cs.AI

    Shaping Inductive Bias in Diffusion Models through Frequency-Based Noise Control

    Authors: Thomas Jiralerspong, Berton Earnshaw, Jason Hartford, Yoshua Bengio, Luca Scimeca

    Abstract: Diffusion Probabilistic Models (DPMs) are powerful generative models that have achieved unparalleled success in a number of generative tasks. In this work, we aim to build inductive biases into the training and sampling of diffusion models to better accommodate the target distribution of the data to model. For topologically structured data, we devise a frequency-based noising operator to purposefu… ▽ More

    Submitted 12 March, 2025; v1 submitted 14 February, 2025; originally announced February 2025.

    Comments: Published as workshop paper at DeLTa and FPI workshops, ICLR 2025

  2. arXiv:2410.20647  [pdf, other

    cs.LG stat.ML

    General Causal Imputation via Synthetic Interventions

    Authors: Marco Jiralerspong, Thomas Jiralerspong, Vedant Shah, Dhanya Sridhar, Gauthier Gidel

    Abstract: Given two sets of elements (such as cell types and drug compounds), researchers typically only have access to a limited subset of their interactions. The task of causal imputation involves using this subset to predict unobserved interactions. Squires et al. (2022) have proposed two estimators for this task based on the synthetic interventions (SI) estimator: SI-A (for actions) and SI-C (for contex… ▽ More

    Submitted 27 October, 2024; originally announced October 2024.

  3. arXiv:2410.14817  [pdf, ps, other

    cs.CL cs.AI cs.LG

    A Complexity-Based Theory of Compositionality

    Authors: Eric Elmoznino, Thomas Jiralerspong, Yoshua Bengio, Guillaume Lajoie

    Abstract: Compositionality is believed to be fundamental to intelligence. In humans, it underlies the structure of thought, language, and higher-level reasoning. In AI, compositional representations can enable a powerful form of out-of-distribution generalization, in which a model systematically adapts to novel combinations of known concepts. However, while we have strong intuitions about what compositional… ▽ More

    Submitted 2 June, 2025; v1 submitted 18 October, 2024; originally announced October 2024.

  4. arXiv:2410.01444  [pdf, ps, other

    cs.CL cs.AI cs.IT cs.LG

    Geometric Signatures of Compositionality Across a Language Model's Lifetime

    Authors: Jin Hwa Lee, Thomas Jiralerspong, Lei Yu, Yoshua Bengio, Emily Cheng

    Abstract: By virtue of linguistic compositionality, few syntactic rules and a finite lexicon can generate an unbounded number of sentences. That is, language, though seemingly high-dimensional, can be explained using relatively few degrees of freedom. An open question is whether contemporary language models (LMs) reflect the intrinsic simplicity of language that is enabled by compositionality. We take a geo… ▽ More

    Submitted 3 June, 2025; v1 submitted 2 October, 2024; originally announced October 2024.

    Comments: Under review at ARR

  5. arXiv:2407.00957  [pdf, other

    cs.NE q-bio.NC stat.ML

    Expressivity of Neural Networks with Random Weights and Learned Biases

    Authors: Ezekiel Williams, Alexandre Payeur, Avery Hee-Woon Ryoo, Thomas Jiralerspong, Matthew G. Perich, Luca Mazzucato, Guillaume Lajoie

    Abstract: Landmark universal function approximation results for neural networks with trained weights and biases provided the impetus for the ubiquitous use of neural networks as learning models in neuroscience and Artificial Intelligence (AI). Recent work has extended these results to networks in which a smaller subset of weights (e.g., output weights) are tuned, leaving other parameters random. However, it… ▽ More

    Submitted 21 March, 2025; v1 submitted 1 July, 2024; originally announced July 2024.

    Comments: upload of camera-ready manuscript accepted as poster at ICLR 2025; change of author order

  6. arXiv:2402.01207  [pdf, other

    cs.LG cs.AI stat.ME

    Efficient Causal Graph Discovery Using Large Language Models

    Authors: Thomas Jiralerspong, Xiaoyin Chen, Yash More, Vedant Shah, Yoshua Bengio

    Abstract: We propose a novel framework that leverages LLMs for full causal graph discovery. While previous LLM-based methods have used a pairwise query approach, this requires a quadratic number of queries which quickly becomes impractical for larger causal graphs. In contrast, the proposed framework uses a breadth-first search (BFS) approach which allows it to use only a linear number of queries. We also s… ▽ More

    Submitted 20 July, 2024; v1 submitted 2 February, 2024; originally announced February 2024.

  7. arXiv:2310.10693  [pdf, other

    cs.SI cs.AI cs.LG

    Network Analysis of the iNaturalist Citizen Science Community

    Authors: Yu Lu Liu, Thomas Jiralerspong

    Abstract: In recent years, citizen science has become a larger and larger part of the scientific community. Its ability to crowd source data and expertise from thousands of citizen scientists makes it invaluable. Despite the field's growing popularity, the interactions and structure of citizen science projects are still poorly understood and under analyzed. We use the iNaturalist citizen science platform as… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  8. arXiv:2310.09997  [pdf, other

    cs.AI cs.LG eess.SY

    Forecaster: Towards Temporally Abstract Tree-Search Planning from Pixels

    Authors: Thomas Jiralerspong, Flemming Kondrup, Doina Precup, Khimya Khetarpal

    Abstract: The ability to plan at many different levels of abstraction enables agents to envision the long-term repercussions of their decisions and thus enables sample-efficient learning. This becomes particularly beneficial in complex environments from high-dimensional state space such as pixels, where the goal is distant and the reward sparse. We introduce Forecaster, a deep hierarchical reinforcement lea… ▽ More

    Submitted 15 October, 2023; originally announced October 2023.

  9. arXiv:2310.02423  [pdf, other

    cs.LG stat.ML

    Delta-AI: Local objectives for amortized inference in sparse graphical models

    Authors: Jean-Pierre Falet, Hae Beom Lee, Nikolay Malkin, Chen Sun, Dragos Secrieru, Thomas Jiralerspong, Dinghuai Zhang, Guillaume Lajoie, Yoshua Bengio

    Abstract: We present a new algorithm for amortized inference in sparse probabilistic graphical models (PGMs), which we call $Δ$-amortized inference ($Δ$-AI). Our approach is based on the observation that when the sampling of variables in a PGM is seen as a sequence of actions taken by an agent, sparsity of the PGM enables local credit assignment in the agent's policy learning objective. This yields a local… ▽ More

    Submitted 13 March, 2024; v1 submitted 3 October, 2023; originally announced October 2023.

    Comments: ICLR 2024; 19 pages, code: https://github.com/GFNOrg/Delta-AI/

  10. arXiv:2308.05711  [pdf, other

    cs.LG eess.SY

    A Comparison of Classical and Deep Reinforcement Learning Methods for HVAC Control

    Authors: Marshall Wang, John Willes, Thomas Jiralerspong, Matin Moezzi

    Abstract: Reinforcement learning (RL) is a promising approach for optimizing HVAC control. RL offers a framework for improving system performance, reducing energy consumption, and enhancing cost efficiency. We benchmark two popular classical and deep RL methods (Q-Learning and Deep-Q-Networks) across multiple HVAC environments and explore the practical consideration of model hyper-parameter selection and re… ▽ More

    Submitted 10 August, 2023; originally announced August 2023.

  11. arXiv:2210.05845  [pdf, other

    cs.LG cs.AI

    Contrastive Retrospection: honing in on critical steps for rapid learning and generalization in RL

    Authors: Chen Sun, Wannan Yang, Thomas Jiralerspong, Dane Malenfant, Benjamin Alsbury-Nealy, Yoshua Bengio, Blake Richards

    Abstract: In real life, success is often contingent upon multiple critical steps that are distant in time from each other and from the final reward. These critical steps are challenging to identify with traditional reinforcement learning (RL) methods that rely on the Bellman equation for credit assignment. Here, we present a new RL algorithm that uses offline contrastive learning to hone in on these critica… ▽ More

    Submitted 27 October, 2023; v1 submitted 11 October, 2022; originally announced October 2022.

  12. arXiv:2210.02552  [pdf, other

    cs.LG

    Towards Safe Mechanical Ventilation Treatment Using Deep Offline Reinforcement Learning

    Authors: Flemming Kondrup, Thomas Jiralerspong, Elaine Lau, Nathan de Lara, Jacob Shkrob, My Duc Tran, Doina Precup, Sumana Basu

    Abstract: Mechanical ventilation is a key form of life support for patients with pulmonary impairment. Healthcare workers are required to continuously adjust ventilator settings for each patient, a challenging and time consuming task. Hence, it would be beneficial to develop an automated decision support tool to optimize ventilation treatment. We present DeepVent, a Conservative Q-Learning (CQL) based offli… ▽ More

    Submitted 5 October, 2022; originally announced October 2022.

    Comments: to be published in IAAI (Innovative Applications of Artificial Intelligence) 2023