Skip to main content

Showing 1–15 of 15 results for author: Grosnit, A

.
  1. arXiv:2501.01544  [pdf, ps, other

    cs.LG cs.CL stat.ML

    Many of Your DPOs are Secretly One: Attempting Unification Through Mutual Information

    Authors: Rasul Tutnov, Antoine Grosnit, Haitham Bou-Ammar

    Abstract: Post-alignment of large language models (LLMs) is critical in improving their utility, safety, and alignment with human intentions. Direct preference optimisation (DPO) has become one of the most widely used algorithms for achieving this alignment, given its ability to optimise models based on human feedback directly. However, the vast number of DPO variants in the literature has made it increasin… ▽ More

    Submitted 2 January, 2025; originally announced January 2025.

  2. arXiv:2411.03562  [pdf, other

    cs.LG cs.AI

    Large Language Models Orchestrating Structured Reasoning Achieve Kaggle Grandmaster Level

    Authors: Antoine Grosnit, Alexandre Maraval, James Doran, Giuseppe Paolo, Albert Thomas, Refinath Shahul Hameed Nabeezath Beevi, Jonas Gonzalez, Khyati Khandelwal, Ignacio Iacobacci, Abdelhakim Benechehab, Hamza Cherkaoui, Youssef Attia El-Hili, Kun Shao, Jianye Hao, Jun Yao, Balazs Kegl, Haitham Bou-Ammar, Jun Wang

    Abstract: We introduce Agent K v1.0, an end-to-end autonomous data science agent designed to automate, optimise, and generalise across diverse data science tasks. Fully automated, Agent K v1.0 manages the entire data science life cycle by learning from experience. It leverages a highly flexible structured reasoning framework to enable it to dynamically process memory in a nested structure, effectively learn… ▽ More

    Submitted 5 November, 2024; originally announced November 2024.

  3. arXiv:2408.09858  [pdf, ps, other

    cs.LG cs.AR

    ShortCircuit: AlphaZero-Driven Circuit Design

    Authors: Dimitrios Tsaras, Antoine Grosnit, Lei Chen, Zhiyao Xie, Haitham Bou-Ammar, Mingxuan Yuan

    Abstract: Chip design relies heavily on generating Boolean circuits, such as AND-Inverter Graphs (AIGs), from functional descriptions like truth tables. This generation operation is a key process in logic synthesis, a primary chip design stage. While recent advances in deep learning have aimed to accelerate circuit design, these efforts have mostly focused on tasks other than synthesis, and traditional heur… ▽ More

    Submitted 2 October, 2024; v1 submitted 19 August, 2024; originally announced August 2024.

  4. arXiv:2406.19741  [pdf, other

    cs.RO cs.AI

    ROS-LLM: A ROS framework for embodied AI with task feedback and structured reasoning

    Authors: Christopher E. Mower, Yuhui Wan, Hongzhan Yu, Antoine Grosnit, Jonas Gonzalez-Billandon, Matthieu Zimmer, Jinlong Wang, Xinyu Zhang, Yao Zhao, Anbang Zhai, Puze Liu, Daniel Palenicek, Davide Tateo, Cesar Cadena, Marco Hutter, Jan Peters, Guangjian Tian, Yuzheng Zhuang, Kun Shao, Xingyue Quan, Jianye Hao, Jun Wang, Haitham Bou-Ammar

    Abstract: We present a framework for intuitive robot programming by non-experts, leveraging natural language prompts and contextual information from the Robot Operating System (ROS). Our system integrates large language models (LLMs), enabling non-experts to articulate task requirements to the system through a chat interface. Key features of the framework include: integration of ROS with an AI agent connect… ▽ More

    Submitted 12 July, 2024; v1 submitted 28 June, 2024; originally announced June 2024.

    Comments: This document contains 26 pages and 13 figures

  5. arXiv:2310.13571  [pdf, ps, other

    cs.CL

    Why Can Large Language Models Generate Correct Chain-of-Thoughts?

    Authors: Rasul Tutunov, Antoine Grosnit, Juliusz Ziomek, Jun Wang, Haitham Bou-Ammar

    Abstract: This paper delves into the capabilities of large language models (LLMs), specifically focusing on advancing the theoretical comprehension of chain-of-thought prompting. We investigate how LLMs can be effectively induced to generate a coherent chain of thoughts. To achieve this, we introduce a two-level hierarchical graphical model tailored for natural language generation. Within this framework, we… ▽ More

    Submitted 6 June, 2024; v1 submitted 20 October, 2023; originally announced October 2023.

  6. arXiv:2306.09803  [pdf, other

    cs.LG

    Framework and Benchmarks for Combinatorial and Mixed-variable Bayesian Optimization

    Authors: Kamil Dreczkowski, Antoine Grosnit, Haitham Bou Ammar

    Abstract: This paper introduces a modular framework for Mixed-variable and Combinatorial Bayesian Optimization (MCBO) to address the lack of systematic benchmarking and standardized evaluation in the field. Current MCBO papers often introduce non-diverse or non-standard benchmarks to evaluate their methods, impeding the proper assessment of different MCBO primitives and their combinations. Additionally, pap… ▽ More

    Submitted 11 December, 2023; v1 submitted 16 June, 2023; originally announced June 2023.

  7. arXiv:2305.15930  [pdf, other

    cs.LG

    End-to-End Meta-Bayesian Optimisation with Transformer Neural Processes

    Authors: Alexandre Maraval, Matthieu Zimmer, Antoine Grosnit, Haitham Bou Ammar

    Abstract: Meta-Bayesian optimisation (meta-BO) aims to improve the sample efficiency of Bayesian optimisation by leveraging data from related tasks. While previous methods successfully meta-learn either a surrogate model or an acquisition function independently, joint training of both components remains an open challenge. This paper proposes the first end-to-end differentiable meta-BO framework that general… ▽ More

    Submitted 22 December, 2023; v1 submitted 25 May, 2023; originally announced May 2023.

  8. arXiv:2301.12412  [pdf, other

    cs.LG

    Contextual Causal Bayesian Optimisation

    Authors: Vahan Arsenyan, Antoine Grosnit, Haitham Bou-Ammar

    Abstract: Causal Bayesian optimisation (CaBO) combines causality with Bayesian optimisation (BO) and shows that there are situations where the optimal reward is not achievable if causal knowledge is ignored. While CaBO exploits causal relations to determine the set of controllable variables to intervene on, it does not exploit purely observational variables and marginalises them. We show that, in general, u… ▽ More

    Submitted 29 January, 2025; v1 submitted 29 January, 2023; originally announced January 2023.

    Comments: 8 pages (not counting references and appendix), 4 figures, 3 graphs

  9. arXiv:2205.13902  [pdf, other

    cs.LG

    Sample-Efficient Optimisation with Probabilistic Transformer Surrogates

    Authors: Alexandre Maraval, Matthieu Zimmer, Antoine Grosnit, Rasul Tutunov, Jun Wang, Haitham Bou Ammar

    Abstract: Faced with problems of increasing complexity, recent research in Bayesian Optimisation (BO) has focused on adapting deep probabilistic models as flexible alternatives to Gaussian Processes (GPs). In a similar vein, this paper investigates the feasibility of employing state-of-the-art probabilistic transformers in BO. Upon further investigation, we observe two drawbacks stemming from their training… ▽ More

    Submitted 30 May, 2022; v1 submitted 27 May, 2022; originally announced May 2022.

  10. arXiv:2201.12570  [pdf, other

    q-bio.BM cs.AI cs.LG cs.NE stat.ML

    AntBO: Towards Real-World Automated Antibody Design with Combinatorial Bayesian Optimisation

    Authors: Asif Khan, Alexander I. Cowen-Rivers, Antoine Grosnit, Derrick-Goh-Xin Deik, Philippe A. Robert, Victor Greiff, Eva Smorodina, Puneet Rawat, Kamil Dreczkowski, Rahmad Akbar, Rasul Tutunov, Dany Bou-Ammar, Jun Wang, Amos Storkey, Haitham Bou-Ammar

    Abstract: Antibodies are canonically Y-shaped multimeric proteins capable of highly specific molecular recognition. The CDRH3 region located at the tip of variable chains of an antibody dominates antigen-binding specificity. Therefore, it is a priority to design optimal antigen-specific CDRH3 regions to develop therapeutic antibodies. However, the combinatorial nature of CDRH3 sequence space makes it imposs… ▽ More

    Submitted 14 October, 2022; v1 submitted 29 January, 2022; originally announced January 2022.

  11. arXiv:2111.06178  [pdf, other

    cs.LG stat.ML

    BOiLS: Bayesian Optimisation for Logic Synthesis

    Authors: Antoine Grosnit, Cedric Malherbe, Rasul Tutunov, Xingchen Wan, Jun Wang, Haitham Bou Ammar

    Abstract: Optimising the quality-of-results (QoR) of circuits during logic synthesis is a formidable challenge necessitating the exploration of exponentially sized search spaces. While expert-designed operations aid in uncovering effective sequences, the increase in complexity of logic circuits favours automated procedures. Inspired by the successes of machine learning, researchers adapted deep learning and… ▽ More

    Submitted 11 November, 2021; originally announced November 2021.

  12. arXiv:2106.03609  [pdf, other

    cs.LG

    High-Dimensional Bayesian Optimisation with Variational Autoencoders and Deep Metric Learning

    Authors: Antoine Grosnit, Rasul Tutunov, Alexandre Max Maraval, Ryan-Rhys Griffiths, Alexander I. Cowen-Rivers, Lin Yang, Lin Zhu, Wenlong Lyu, Zhitang Chen, Jun Wang, Jan Peters, Haitham Bou-Ammar

    Abstract: We introduce a method combining variational autoencoders (VAEs) and deep metric learning to perform Bayesian optimisation (BO) over high-dimensional and structured input spaces. By adapting ideas from deep metric learning, we use label guidance from the blackbox function to structure the VAE latent space, facilitating the Gaussian process fit and yielding improved BO performance. Importantly for B… ▽ More

    Submitted 1 November, 2021; v1 submitted 7 June, 2021; originally announced June 2021.

  13. arXiv:2102.09745  [pdf, other

    cs.LG

    Decentralized Deterministic Multi-Agent Reinforcement Learning

    Authors: Antoine Grosnit, Desmond Cai, Laura Wynter

    Abstract: [Zhang, ICML 2018] provided the first decentralized actor-critic algorithm for multi-agent reinforcement learning (MARL) that offers convergence guarantees. In that work, policies are stochastic and are defined on finite action spaces. We extend those results to offer a provably-convergent decentralized actor-critic algorithm for learning deterministic policies on continuous action spaces. Determi… ▽ More

    Submitted 19 February, 2021; originally announced February 2021.

  14. arXiv:2012.08240  [pdf, other

    cs.LG stat.ML

    Are we Forgetting about Compositional Optimisers in Bayesian Optimisation?

    Authors: Antoine Grosnit, Alexander I. Cowen-Rivers, Rasul Tutunov, Ryan-Rhys Griffiths, Jun Wang, Haitham Bou-Ammar

    Abstract: Bayesian optimisation presents a sample-efficient methodology for global optimisation. Within this framework, a crucial performance-determining subroutine is the maximisation of the acquisition function, a task complicated by the fact that acquisition functions tend to be non-convex and thus nontrivial to optimise. In this paper, we undertake a comprehensive empirical study of approaches to maximi… ▽ More

    Submitted 17 December, 2020; v1 submitted 15 December, 2020; originally announced December 2020.

  15. arXiv:2012.03826  [pdf, other

    cs.LG math.OC

    HEBO Pushing The Limits of Sample-Efficient Hyperparameter Optimisation

    Authors: Alexander I. Cowen-Rivers, Wenlong Lyu, Rasul Tutunov, Zhi Wang, Antoine Grosnit, Ryan Rhys Griffiths, Alexandre Max Maraval, Hao Jianye, Jun Wang, Jan Peters, Haitham Bou Ammar

    Abstract: In this work we rigorously analyse assumptions inherent to black-box optimisation hyper-parameter tuning tasks. Our results on the Bayesmark benchmark indicate that heteroscedasticity and non-stationarity pose significant challenges for black-box optimisers. Based on these findings, we propose a Heteroscedastic and Evolutionary Bayesian Optimisation solver (HEBO). HEBO performs non-linear input an… ▽ More

    Submitted 25 May, 2022; v1 submitted 7 December, 2020; originally announced December 2020.

    Comments: Accepted at JAIR