Search | arXiv e-print repository

AlphaEvolve: A coding agent for scientific and algorithmic discovery

Authors: Alexander Novikov, Ngân Vũ, Marvin Eisenberger, Emilien Dupont, Po-Sen Huang, Adam Zsolt Wagner, Sergey Shirobokov, Borislav Kozlovskii, Francisco J. R. Ruiz, Abbas Mehrabian, M. Pawan Kumar, Abigail See, Swarat Chaudhuri, George Holland, Alex Davies, Sebastian Nowozin, Pushmeet Kohli, Matej Balog

Abstract: In this white paper, we present AlphaEvolve, an evolutionary coding agent that substantially enhances capabilities of state-of-the-art LLMs on highly challenging tasks such as tackling open scientific problems or optimizing critical pieces of computational infrastructure. AlphaEvolve orchestrates an autonomous pipeline of LLMs, whose task is to improve an algorithm by making direct changes to the… ▽ More In this white paper, we present AlphaEvolve, an evolutionary coding agent that substantially enhances capabilities of state-of-the-art LLMs on highly challenging tasks such as tackling open scientific problems or optimizing critical pieces of computational infrastructure. AlphaEvolve orchestrates an autonomous pipeline of LLMs, whose task is to improve an algorithm by making direct changes to the code. Using an evolutionary approach, continuously receiving feedback from one or more evaluators, AlphaEvolve iteratively improves the algorithm, potentially leading to new scientific and practical discoveries. We demonstrate the broad applicability of this approach by applying it to a number of important computational problems. When applied to optimizing critical components of large-scale computational stacks at Google, AlphaEvolve developed a more efficient scheduling algorithm for data centers, found a functionally equivalent simplification in the circuit design of hardware accelerators, and accelerated the training of the LLM underpinning AlphaEvolve itself. Furthermore, AlphaEvolve discovered novel, provably correct algorithms that surpass state-of-the-art solutions on a spectrum of problems in mathematics and computer science, significantly expanding the scope of prior automated discovery methods (Romera-Paredes et al., 2023). Notably, AlphaEvolve developed a search algorithm that found a procedure to multiply two $4 \times 4$ complex-valued matrices using $48$ scalar multiplications; offering the first improvement, after 56 years, over Strassen's algorithm in this setting. We believe AlphaEvolve and coding agents like it can have a significant impact in improving solutions of problems across many areas of science and computation. △ Less

Submitted 16 June, 2025; originally announced June 2025.

arXiv:2209.15486 [pdf, other]

Graph Neural Networks for Link Prediction with Subgraph Sketching

Authors: Benjamin Paul Chamberlain, Sergey Shirobokov, Emanuele Rossi, Fabrizio Frasca, Thomas Markovich, Nils Hammerla, Michael M. Bronstein, Max Hansmire

Abstract: Many Graph Neural Networks (GNNs) perform poorly compared to simple heuristics on Link Prediction (LP) tasks. This is due to limitations in expressive power such as the inability to count triangles (the backbone of most LP heuristics) and because they can not distinguish automorphic nodes (those having identical structural roles). Both expressiveness issues can be alleviated by learning link (rath… ▽ More Many Graph Neural Networks (GNNs) perform poorly compared to simple heuristics on Link Prediction (LP) tasks. This is due to limitations in expressive power such as the inability to count triangles (the backbone of most LP heuristics) and because they can not distinguish automorphic nodes (those having identical structural roles). Both expressiveness issues can be alleviated by learning link (rather than node) representations and incorporating structural features such as triangle counts. Since explicit link representations are often prohibitively expensive, recent works resorted to subgraph-based methods, which have achieved state-of-the-art performance for LP, but suffer from poor efficiency due to high levels of redundancy between subgraphs. We analyze the components of subgraph GNN (SGNN) methods for link prediction. Based on our analysis, we propose a novel full-graph GNN called ELPH (Efficient Link Prediction with Hashing) that passes subgraph sketches as messages to approximate the key components of SGNNs without explicit subgraph construction. ELPH is provably more expressive than Message Passing GNNs (MPNNs). It outperforms existing SGNN models on many standard LP benchmarks while being orders of magnitude faster. However, it shares the common GNN limitation that it is only efficient when the dataset fits in GPU memory. Accordingly, we develop a highly scalable model, called BUDDY, which uses feature precomputation to circumvent this limitation without sacrificing predictive performance. Our experiments show that BUDDY also outperforms SGNNs on standard LP benchmarks while being highly scalable and faster than ELPH. △ Less

Submitted 2 May, 2023; v1 submitted 30 September, 2022; originally announced September 2022.

Comments: 29 pages, 19 figures, 6 appendices

Journal ref: The Eleventh International Conference on Learning Representations 2023 (oral - top 5%)

arXiv:2011.05115 [pdf, other]

doi 10.1140/epjc/s10052-021-09224-3

Sensitivity of the SHiP experiment to dark photons decaying to a pair of charged particles

Authors: SHiP Collaboration, C. Ahdida, A. Akmete, R. Albanese, A. Alexandrov, A. Anokhina, S. Aoki, G. Arduini, E. Atkin, N. Azorskiy, J. J. Back, A. Bagulya, F. Baaltasar Dos Santos, A. Baranov, F. Bardou, G. J. Barker, M. Battistin, J. Bauche, A. Bay, V. Bayliss, G. Bencivenni, A. Y. Berdnikov, Y. A. Berdnikov, M. Bertani, C. Betancourt , et al. (309 additional authors not shown)

Abstract: Dark photons are hypothetical massive vector particles that could mix with ordinary photons. The simplest theoretical model is fully characterised by only two parameters: the mass of the dark photon m$_{γ^{\mathrm{D}}}$ and its mixing parameter with the photon, $\varepsilon$. The sensitivity of the SHiP detector is reviewed for dark photons in the mass range between 0.002 and 10 GeV. Different pro… ▽ More Dark photons are hypothetical massive vector particles that could mix with ordinary photons. The simplest theoretical model is fully characterised by only two parameters: the mass of the dark photon m$_{γ^{\mathrm{D}}}$ and its mixing parameter with the photon, $\varepsilon$. The sensitivity of the SHiP detector is reviewed for dark photons in the mass range between 0.002 and 10 GeV. Different production mechanisms are simulated, with the dark photons decaying to pairs of visible fermions, including both leptons and quarks. Exclusion contours are presented and compared with those of past experiments. The SHiP detector is expected to have a unique sensitivity for m$_{γ^{\mathrm{D}}}$ ranging between 0.8 and 3.3$^{+0.2}_{-0.5}$ GeV, and $\varepsilon^2$ ranging between $10^{-11}$ and $10^{-17}$. △ Less

Submitted 1 March, 2021; v1 submitted 10 November, 2020; originally announced November 2020.

arXiv:2002.08722 [pdf, other]

SND@LHC

Authors: SHiP Collaboration, C. Ahdida, A. Akmete, R. Albanese, A. Alexandrov, M. Andreini, A. Anokhina, S. Aoki, G. Arduini, E. Atkin, N. Azorskiy, J. J. Back, A. Bagulya, F. Baaltasar Dos Santos, A. Baranov, F. Bardou, G. J. Barker, M. Battistin, J. Bauche, A. Bay, V. Bayliss, G. Bencivenni, A. Y. Berdnikov, Y. A. Berdnikov, M. Bertani , et al. (319 additional authors not shown)

Abstract: We propose to build and operate a detector that, for the first time, will measure the process $pp\toνX$ at the LHC and search for feebly interacting particles (FIPs) in an unexplored domain. The TI18 tunnel has been identified as a suitable site to perform these measurements due to very low machine-induced background. The detector will be off-axis with respect to the ATLAS interaction point (IP1)… ▽ More We propose to build and operate a detector that, for the first time, will measure the process $pp\toνX$ at the LHC and search for feebly interacting particles (FIPs) in an unexplored domain. The TI18 tunnel has been identified as a suitable site to perform these measurements due to very low machine-induced background. The detector will be off-axis with respect to the ATLAS interaction point (IP1) and, given the pseudo-rapidity range accessible, the corresponding neutrinos will mostly come from charm decays: the proposed experiment will thus make the first test of the heavy flavour production in a pseudo-rapidity range that is not accessible by the current LHC detectors. In order to efficiently reconstruct neutrino interactions and identify their flavour, the detector will combine in the target region nuclear emulsion technology with scintillating fibre tracking layers and it will adopt a muon identification system based on scintillating bars that will also play the role of a hadronic calorimeter. The time of flight measurement will be achieved thanks to a dedicated timing detector. The detector will be a small-scale prototype of the scattering and neutrino detector (SND) of the SHiP experiment: the operation of this detector will provide an important test of the neutrino reconstruction in a high occupancy environment. △ Less

Submitted 20 February, 2020; originally announced February 2020.

Comments: Letter of Intent

Report number: CERN-LHCC-2020-002, LHCC-I-035

arXiv:2002.04632 [pdf, other]

Black-Box Optimization with Local Generative Surrogates

Authors: Sergey Shirobokov, Vladislav Belavin, Michael Kagan, Andrey Ustyuzhanin, Atılım Güneş Baydin

Abstract: We propose a novel method for gradient-based optimization of black-box simulators using differentiable local surrogate models. In fields such as physics and engineering, many processes are modeled with non-differentiable simulators with intractable likelihoods. Optimization of these forward models is particularly challenging, especially when the simulator is stochastic. To address such cases, we i… ▽ More We propose a novel method for gradient-based optimization of black-box simulators using differentiable local surrogate models. In fields such as physics and engineering, many processes are modeled with non-differentiable simulators with intractable likelihoods. Optimization of these forward models is particularly challenging, especially when the simulator is stochastic. To address such cases, we introduce the use of deep generative models to iteratively approximate the simulator in local neighborhoods of the parameter space. We demonstrate that these local surrogates can be used to approximate the gradient of the simulator, and thus enable gradient-based optimization of simulator parameters. In cases where the dependence of the simulator on the parameter space is constrained to a low dimensional submanifold, we observe that our method attains minima faster than baseline methods, including Bayesian optimization, numerical optimization, and approaches using score function gradient estimators. △ Less

Submitted 15 June, 2020; v1 submitted 11 February, 2020; originally announced February 2020.

Journal ref: In Advances in Neural Information Processing Systems 34 (NeurIPS), 2020

Showing 1–5 of 5 results for author: Shirobokov, S