Skip to main content

Showing 1–22 of 22 results for author: Lahlou, S

.
  1. arXiv:2506.02515  [pdf, ps, other

    cs.CL cs.AI cs.LG

    FinChain: A Symbolic Benchmark for Verifiable Chain-of-Thought Financial Reasoning

    Authors: Zhuohan Xie, Dhruv Sahnan, Debopriyo Banerjee, Georgi Georgiev, Rushil Thareja, Hachem Madmoun, Jinyan Su, Aaryamonvikram Singh, Yuxia Wang, Rui Xing, Fajri Koto, Haonan Li, Ivan Koychev, Tanmoy Chakraborty, Salem Lahlou, Veselin Stoyanov, Preslav Nakov

    Abstract: Multi-step symbolic reasoning is critical for advancing downstream performance on financial tasks. Yet, benchmarks for systematically evaluating this capability are lacking. Existing datasets like FinQA and ConvFinQA supervise only final numerical answers, without assessing intermediate reasoning steps. To address this, we introduce FinChain, the first symbolic benchmark designed for verifiable Ch… ▽ More

    Submitted 3 June, 2025; originally announced June 2025.

    Comments: 15 pages, 8 figures, 2 tables

  2. arXiv:2505.21887  [pdf, ps, other

    cs.AI cs.CE cs.LG

    SVRPBench: A Realistic Benchmark for Stochastic Vehicle Routing Problem

    Authors: Ahmed Heakl, Yahia Salaheldin Shaaban, Martin Takac, Salem Lahlou, Zangir Iklassov

    Abstract: Robust routing under uncertainty is central to real-world logistics, yet most benchmarks assume static, idealized settings. We present SVRPBench, the first open benchmark to capture high-fidelity stochastic dynamics in vehicle routing at urban scale. Spanning more than 500 instances with up to 1000 customers, it simulates realistic delivery conditions: time-dependent congestion, log-normal delays,… ▽ More

    Submitted 29 May, 2025; v1 submitted 27 May, 2025; originally announced May 2025.

    Comments: 18 pages, 14 figures, 11 tables

  3. arXiv:2505.20964  [pdf, ps, other

    cs.LG cs.IT

    Semantic Communication meets System 2 ML: How Abstraction, Compositionality and Emergent Languages Shape Intelligence

    Authors: Mehdi Bennis, Salem Lahlou

    Abstract: The trajectories of 6G and AI are set for a creative collision. However, current visions for 6G remain largely incremental evolutions of 5G, while progress in AI is hampered by brittle, data-hungry models that lack robust reasoning capabilities. This paper argues for a foundational paradigm shift, moving beyond the purely technical level of communication toward systems capable of semantic understa… ▽ More

    Submitted 27 May, 2025; originally announced May 2025.

  4. arXiv:2505.15251  [pdf, ps, other

    cs.LG

    Loss-Guided Auxiliary Agents for Overcoming Mode Collapse in GFlowNets

    Authors: Idriss Malek, Abhijit Sharma, Salem Lahlou

    Abstract: Although Generative Flow Networks (GFlowNets) are designed to capture multiple modes of a reward function, they often suffer from mode collapse in practice, getting trapped in early discovered modes and requiring prolonged training to find diverse solutions. Existing exploration techniques may rely on heuristic novelty signals. We propose Loss-Guided GFlowNets (LGGFN), a novel approach where an au… ▽ More

    Submitted 21 May, 2025; originally announced May 2025.

  5. arXiv:2505.12135  [pdf, other

    cs.AI cs.CL

    LLM-BABYBENCH: Understanding and Evaluating Grounded Planning and Reasoning in LLMs

    Authors: Omar Choukrani, Idriss Malek, Daniil Orel, Zhuohan Xie, Zangir Iklassov, Martin Takáč, Salem Lahlou

    Abstract: Assessing the capacity of Large Language Models (LLMs) to plan and reason within the constraints of interactive environments is crucial for developing capable AI agents. We introduce $\textbf{LLM-BabyBench}$, a new benchmark suite designed specifically for this purpose. Built upon a textual adaptation of the procedurally generated BabyAI grid world, this suite evaluates LLMs on three fundamental a… ▽ More

    Submitted 17 May, 2025; originally announced May 2025.

  6. arXiv:2504.19990  [pdf, ps, other

    cs.CY cs.AI

    Mitigating Societal Cognitive Overload in the Age of AI: Challenges and Directions

    Authors: Salem Lahlou

    Abstract: Societal cognitive overload, driven by the deluge of information and complexity in the AI age, poses a critical challenge to human well-being and societal resilience. This paper argues that mitigating cognitive overload is not only essential for improving present-day life but also a crucial prerequisite for navigating the potential risks of advanced AI, including existential threats. We examine ho… ▽ More

    Submitted 28 April, 2025; originally announced April 2025.

  7. arXiv:2504.19981  [pdf

    cs.LG cs.CL

    Accurate and Diverse LLM Mathematical Reasoning via Automated PRM-Guided GFlowNets

    Authors: Adam Younsi, Abdalgader Abubaker, Mohamed El Amine Seddik, Hakim Hacid, Salem Lahlou

    Abstract: Achieving both accuracy and diverse reasoning remains challenging for Large Language Models (LLMs) in complex domains like mathematics. A key bottleneck is evaluating intermediate reasoning steps to guide generation without costly human annotations. To address this, we first introduce a novel Process Reward Model (PRM) trained automatically using Monte Carlo Tree Search coupled with a similarity-b… ▽ More

    Submitted 8 May, 2025; v1 submitted 28 April, 2025; originally announced April 2025.

  8. arXiv:2502.13191  [pdf, ps, other

    cs.LG cs.AI

    On the Privacy Risks of Spiking Neural Networks: A Membership Inference Analysis

    Authors: Junyi Guan, Abhijith Sharma, Chong Tian, Salem Lahlou

    Abstract: Spiking Neural Networks (SNNs) are increasingly explored for their energy efficiency and robustness in real-world applications, yet their privacy risks remain largely unexamined. In this work, we investigate the susceptibility of SNNs to Membership Inference Attacks (MIAs) -- a major privacy threat where an adversary attempts to determine whether a given sample was part of the training dataset. Wh… ▽ More

    Submitted 11 June, 2025; v1 submitted 18 February, 2025; originally announced February 2025.

    Comments: 14 pages, 6 figures

  9. arXiv:2407.03105  [pdf, other

    cs.LG

    On Generalization for Generative Flow Networks

    Authors: Anas Krichel, Nikolay Malkin, Salem Lahlou, Yoshua Bengio

    Abstract: Generative Flow Networks (GFlowNets) have emerged as an innovative learning paradigm designed to address the challenge of sampling from an unnormalized probability distribution, called the reward function. This framework learns a policy on a constructed graph, which enables sampling from an approximation of the target probability distribution through successive steps of sampling from the learned p… ▽ More

    Submitted 3 July, 2024; originally announced July 2024.

  10. arXiv:2406.16061  [pdf, other

    cs.LG cs.CL

    PORT: Preference Optimization on Reasoning Traces

    Authors: Salem Lahlou, Abdalgader Abubaker, Hakim Hacid

    Abstract: Preference optimization methods have been successfully applied to improve not only the alignment of large language models (LLMs) with human values, but also specific natural language tasks such as summarization and stylistic continuations. This paper proposes using preference optimization methods on Chain-of-Thought steps in order to improve the mathematical reasoning performances of language mode… ▽ More

    Submitted 4 February, 2025; v1 submitted 23 June, 2024; originally announced June 2024.

  11. arXiv:2404.04291  [pdf, other

    cs.LG

    Investigating Regularization of Self-Play Language Models

    Authors: Reda Alami, Abdalgader Abubaker, Mastane Achab, Mohamed El Amine Seddik, Salem Lahlou

    Abstract: This paper explores the effects of various forms of regularization in the context of language model alignment via self-play. While both reinforcement learning from human feedback (RLHF) and direct preference optimization (DPO) require to collect costly human-annotated pairwise preferences, the self-play fine-tuning (SPIN) approach replaces the rejected answers by data generated from the previous i… ▽ More

    Submitted 4 April, 2024; originally announced April 2024.

  12. arXiv:2306.15058  [pdf, other

    cs.LG stat.ML

    BatchGFN: Generative Flow Networks for Batch Active Learning

    Authors: Shreshth A. Malik, Salem Lahlou, Andrew Jesson, Moksh Jain, Nikolay Malkin, Tristan Deleu, Yoshua Bengio, Yarin Gal

    Abstract: We introduce BatchGFN -- a novel approach for pool-based active learning that uses generative flow networks to sample sets of data points proportional to a batch reward. With an appropriate reward function to quantify the utility of acquiring a batch, such as the joint mutual information between the batch and the model parameters, BatchGFN is able to construct highly informative batches for active… ▽ More

    Submitted 26 June, 2023; originally announced June 2023.

    Comments: Accepted at the Structured Probabilistic Inference & Generative Modeling workshop, ICML 2023

  13. arXiv:2306.13831  [pdf, other

    cs.LG

    Minigrid & Miniworld: Modular & Customizable Reinforcement Learning Environments for Goal-Oriented Tasks

    Authors: Maxime Chevalier-Boisvert, Bolun Dai, Mark Towers, Rodrigo de Lazcano, Lucas Willems, Salem Lahlou, Suman Pal, Pablo Samuel Castro, Jordan Terry

    Abstract: We present the Minigrid and Miniworld libraries which provide a suite of goal-oriented 2D and 3D environments. The libraries were explicitly created with a minimalistic design paradigm to allow users to rapidly develop new environments for a wide range of research-specific needs. As a result, both have received widescale adoption by the RL community, facilitating research in a wide range of areas.… ▽ More

    Submitted 23 June, 2023; originally announced June 2023.

  14. arXiv:2305.14594  [pdf, other

    cs.LG

    torchgfn: A PyTorch GFlowNet library

    Authors: Salem Lahlou, Joseph D. Viviano, Victor Schmidt, Yoshua Bengio

    Abstract: The growing popularity of generative flow networks (GFlowNets or GFNs) from a range of researchers with diverse backgrounds and areas of expertise necessitates a library which facilitates the testing of new features such as training losses that can be easily compared to standard benchmark implementations, or on a set of common environments. torchgfn is a PyTorch library that aims to address this n… ▽ More

    Submitted 29 August, 2023; v1 submitted 23 May, 2023; originally announced May 2023.

  15. arXiv:2301.12594  [pdf, other

    cs.LG stat.ML

    A theory of continuous generative flow networks

    Authors: Salem Lahlou, Tristan Deleu, Pablo Lemos, Dinghuai Zhang, Alexandra Volokhova, Alex Hernández-García, Léna Néhale Ezzine, Yoshua Bengio, Nikolay Malkin

    Abstract: Generative flow networks (GFlowNets) are amortized variational inference algorithms that are trained to sample from unnormalized target distributions over compositional objects. A key limitation of GFlowNets until this time has been that they are restricted to discrete spaces. We present a theory for generalized GFlowNets, which encompasses both existing discrete GFlowNets and ones with continuous… ▽ More

    Submitted 25 May, 2023; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: ICML 2023; 32 pages; code: https://github.com/saleml/continuous-gfn

  16. arXiv:2210.12928  [pdf, other

    cs.LG cs.AI

    GFlowOut: Dropout with Generative Flow Networks

    Authors: Dianbo Liu, Moksh Jain, Bonaventure Dossou, Qianli Shen, Salem Lahlou, Anirudh Goyal, Nikolay Malkin, Chris Emezue, Dinghuai Zhang, Nadhir Hassen, Xu Ji, Kenji Kawaguchi, Yoshua Bengio

    Abstract: Bayesian Inference offers principled tools to tackle many critical problems with modern neural networks such as poor calibration and generalization, and data inefficiency. However, scaling Bayesian inference to large architectures is challenging and requires restrictive approximations. Monte Carlo Dropout has been widely used as a relatively cheap way for approximate Inference and to estimate unce… ▽ More

    Submitted 23 June, 2023; v1 submitted 23 October, 2022; originally announced October 2022.

  17. arXiv:2210.00580  [pdf, other

    cs.LG stat.ML

    GFlowNets and variational inference

    Authors: Nikolay Malkin, Salem Lahlou, Tristan Deleu, Xu Ji, Edward Hu, Katie Everett, Dinghuai Zhang, Yoshua Bengio

    Abstract: This paper builds bridges between two families of probabilistic algorithms: (hierarchical) variational inference (VI), which is typically used to model distributions over continuous spaces, and generative flow networks (GFlowNets), which have been used for distributions over discrete structures such as graphs. We demonstrate that, in certain cases, VI algorithms are equivalent to special cases of… ▽ More

    Submitted 2 March, 2023; v1 submitted 2 October, 2022; originally announced October 2022.

    Comments: ICLR 2023 final version; code: https://github.com/GFNOrg/GFN_vs_HVI

  18. arXiv:2111.09266  [pdf, other

    cs.LG cs.AI stat.ML

    GFlowNet Foundations

    Authors: Yoshua Bengio, Salem Lahlou, Tristan Deleu, Edward J. Hu, Mo Tiwari, Emmanuel Bengio

    Abstract: Generative Flow Networks (GFlowNets) have been introduced as a method to sample a diverse set of candidates in an active learning context, with a training objective that makes them approximately sample in proportion to a given reward function. In this paper, we show a number of additional theoretical properties of GFlowNets. They can be used to estimate joint probability distributions and the corr… ▽ More

    Submitted 10 July, 2023; v1 submitted 17 November, 2021; originally announced November 2021.

  19. arXiv:2102.08501  [pdf, other

    cs.LG stat.ML

    DEUP: Direct Epistemic Uncertainty Prediction

    Authors: Salem Lahlou, Moksh Jain, Hadi Nekoei, Victor Ion Butoi, Paul Bertin, Jarrid Rector-Brooks, Maksym Korablyov, Yoshua Bengio

    Abstract: Epistemic Uncertainty is a measure of the lack of knowledge of a learner which diminishes with more evidence. While existing work focuses on using the variance of the Bayesian posterior due to parameter uncertainty as a measure of epistemic uncertainty, we argue that this does not capture the part of lack of knowledge induced by model misspecification. We discuss how the excess risk, which is the… ▽ More

    Submitted 3 February, 2023; v1 submitted 16 February, 2021; originally announced February 2021.

  20. arXiv:2008.06456  [pdf, other

    cs.LG stat.ML

    Mastering Rate based Curriculum Learning

    Authors: Lucas Willems, Salem Lahlou, Yoshua Bengio

    Abstract: Recent automatic curriculum learning algorithms, and in particular Teacher-Student algorithms, rely on the notion of learning progress, making the assumption that the good next tasks are the ones on which the learner is making the fastest progress or digress. In this work, we first propose a simpler and improved version of these algorithms. We then argue that the notion of learning progress itself… ▽ More

    Submitted 14 August, 2020; originally announced August 2020.

  21. arXiv:1904.04478  [pdf, other

    stat.ML cs.LG

    Kernelized Complete Conditional Stein Discrepancy

    Authors: Raghav Singhal, Xintian Han, Saad Lahlou, Rajesh Ranganath

    Abstract: Much of machine learning relies on comparing distributions with discrepancy measures. Stein's method creates discrepancy measures between two distributions that require only the unnormalized density of one and samples from the other. Stein discrepancies can be combined with kernels to define kernelized Stein discrepancies (KSDs). While kernels make Stein discrepancies tractable, they pose several… ▽ More

    Submitted 17 July, 2020; v1 submitted 9 April, 2019; originally announced April 2019.

  22. arXiv:1810.08272  [pdf, other

    cs.AI cs.CL

    BabyAI: A Platform to Study the Sample Efficiency of Grounded Language Learning

    Authors: Maxime Chevalier-Boisvert, Dzmitry Bahdanau, Salem Lahlou, Lucas Willems, Chitwan Saharia, Thien Huu Nguyen, Yoshua Bengio

    Abstract: Allowing humans to interactively train artificial agents to understand language instructions is desirable for both practical and scientific reasons, but given the poor data efficiency of the current learning methods, this goal may require substantial research efforts. Here, we introduce the BabyAI research platform to support investigations towards including humans in the loop for grounded languag… ▽ More

    Submitted 19 December, 2019; v1 submitted 18 October, 2018; originally announced October 2018.

    Comments: Accepted at ICLR 2019