AIDE: AI-Driven Exploration in the Space of Code

Jiang, Zhengyao; Schmidt, Dominik; Srikanth, Dhruv; Xu, Dixing; Kaplan, Ian; Jacenko, Deniss; Wu, Yuxiang

Computer Science > Artificial Intelligence

arXiv:2502.13138 (cs)

[Submitted on 18 Feb 2025]

Title:AIDE: AI-Driven Exploration in the Space of Code

Authors:Zhengyao Jiang, Dominik Schmidt, Dhruv Srikanth, Dixing Xu, Ian Kaplan, Deniss Jacenko, Yuxiang Wu

View PDF HTML (experimental)

Abstract:Machine learning, the foundation of modern artificial intelligence, has driven innovations that have fundamentally transformed the world. Yet, behind advancements lies a complex and often tedious process requiring labor and compute intensive iteration and experimentation. Engineers and scientists developing machine learning models spend much of their time on trial-and-error tasks instead of conceptualizing innovative solutions or research hypotheses. To address this challenge, we introduce AI-Driven Exploration (AIDE), a machine learning engineering agent powered by large language models (LLMs). AIDE frames machine learning engineering as a code optimization problem, and formulates trial-and-error as a tree search in the space of potential solutions. By strategically reusing and refining promising solutions, AIDE effectively trades computational resources for enhanced performance, achieving state-of-the-art results on multiple machine learning engineering benchmarks, including our Kaggle evaluations, OpenAI MLE-Bench and METRs RE-Bench.

Subjects:	Artificial Intelligence (cs.AI); Machine Learning (cs.LG)
Cite as:	arXiv:2502.13138 [cs.AI]
	(or arXiv:2502.13138v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2502.13138

Submission history

From: Yuxiang Wu [view email]
[v1] Tue, 18 Feb 2025 18:57:21 UTC (353 KB)

Computer Science > Artificial Intelligence

Title:AIDE: AI-Driven Exploration in the Space of Code

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:AIDE: AI-Driven Exploration in the Space of Code

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators