Large Scale Multi-Task Bayesian Optimization with Large Language Models

Zeng, Yimeng; Maus, Natalie; Jones, Haydn Thomas; Tao, Jeffrey; Wan, Fangping; Torres, Marcelo Der Torossian; de la Fuente-Nunez, Cesar; Marcus, Ryan; Bastani, Osbert; Gardner, Jacob R.

Computer Science > Machine Learning

arXiv:2503.08131 (cs)

[Submitted on 11 Mar 2025]

Title:Large Scale Multi-Task Bayesian Optimization with Large Language Models

Authors:Yimeng Zeng, Natalie Maus, Haydn Thomas Jones, Jeffrey Tao, Fangping Wan, Marcelo Der Torossian Torres, Cesar de la Fuente-Nunez, Ryan Marcus, Osbert Bastani, Jacob R. Gardner

View PDF HTML (experimental)

Abstract:In multi-task Bayesian optimization, the goal is to leverage experience from optimizing existing tasks to improve the efficiency of optimizing new ones. While approaches using multi-task Gaussian processes or deep kernel transfer exist, the performance improvement is marginal when scaling to more than a moderate number of tasks. We introduce a novel approach leveraging large language models (LLMs) to learn from, and improve upon, previous optimization trajectories, scaling to approximately 2000 distinct tasks. Specifically, we propose an iterative framework in which an LLM is fine-tuned using the high quality solutions produced by BayesOpt to generate improved initializations that accelerate convergence for future optimization tasks based on previous search trajectories. We evaluate our method on two distinct domains: database query optimization and antimicrobial peptide design. Results demonstrate that our approach creates a positive feedback loop, where the LLM's generated initializations gradually improve, leading to better optimization performance. As this feedback loop continues, we find that the LLM is eventually able to generate solutions to new tasks in just a few shots that are better than the solutions produced by "from scratch" by Bayesian optimization while simultaneously requiring significantly fewer oracle calls.

Subjects:	Machine Learning (cs.LG)
Cite as:	arXiv:2503.08131 [cs.LG]
	(or arXiv:2503.08131v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2503.08131

Submission history

From: Yimeng Zeng [view email]
[v1] Tue, 11 Mar 2025 07:46:19 UTC (1,926 KB)

Computer Science > Machine Learning

Title:Large Scale Multi-Task Bayesian Optimization with Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Large Scale Multi-Task Bayesian Optimization with Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators