Skip to main content

Showing 1–12 of 12 results for author: Terekhov, M

.
  1. arXiv:2506.05296  [pdf, ps, other

    cs.AI cs.LG

    Control Tax: The Price of Keeping AI in Check

    Authors: Mikhail Terekhov, Zhen Ning David Liu, Caglar Gulcehre, Samuel Albanie

    Abstract: The rapid integration of agentic AI into high-stakes real-world applications requires robust oversight mechanisms. The emerging field of AI Control (AIC) aims to provide such an oversight mechanism, but practical adoption depends heavily on implementation overhead. To study this problem better, we introduce the notion of Control tax -- the operational and financial cost of integrating control meas… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

  2. arXiv:2503.11703  [pdf, ps, other

    cs.LG

    Physical knowledge improves prediction of EM Fields

    Authors: Andrzej Dulny, Farzad Jabbarigargari, Andreas Hotho, Laura Maria Schreiber, Maxim Terekhov, Anna Krause

    Abstract: We propose a 3D U-Net model to predict the spatial distribution of electromagnetic fields inside a radio-frequency (RF) coil with a subject present, using the phase, amplitude, and position of the coils, along with the density, permittivity, and conductivity of the surrounding medium as inputs. To improve accuracy, we introduce a physics-augmented variant, U-Net Phys, which incorporates Gauss's la… ▽ More

    Submitted 12 March, 2025; originally announced March 2025.

  3. arXiv:2502.02619  [pdf, other

    q-fin.PM cs.LG q-fin.RM

    Regret-Optimized Portfolio Enhancement through Deep Reinforcement Learning and Future Looking Rewards

    Authors: Daniil Karzanov, Rubén Garzón, Mikhail Terekhov, Caglar Gulcehre, Thomas Raffinot, Marcin Detyniecki

    Abstract: This paper introduces a novel agent-based approach for enhancing existing portfolio strategies using Proximal Policy Optimization (PPO). Rather than focusing solely on traditional portfolio construction, our approach aims to improve an already high-performing strategy through dynamic rebalancing driven by PPO and Oracle agents. Our target is to enhance the traditional 60/40 benchmark (60% stocks,… ▽ More

    Submitted 4 February, 2025; originally announced February 2025.

    Comments: 11 pages, 7 figures

  4. arXiv:2501.06258  [pdf, other

    cs.LG

    Contextual Bandit Optimization with Pre-Trained Neural Networks

    Authors: Mikhail Terekhov

    Abstract: Bandit optimization is a difficult problem, especially if the reward model is high-dimensional. When rewards are modeled by neural networks, sublinear regret has only been shown under strong assumptions, usually when the network is extremely wide. In this thesis, we investigate how pre-training can help us in the regime of smaller models. We consider a stochastic contextual bandit with the rewards… ▽ More

    Submitted 9 January, 2025; originally announced January 2025.

    Comments: Master's thesis

  5. arXiv:2410.22366  [pdf, ps, other

    cs.LG cs.AI cs.CV

    One-Step is Enough: Sparse Autoencoders for Text-to-Image Diffusion Models

    Authors: Viacheslav Surkov, Chris Wendler, Antonio Mari, Mikhail Terekhov, Justin Deschenaux, Robert West, Caglar Gulcehre, David Bau

    Abstract: For large language models (LLMs), sparse autoencoders (SAEs) have been shown to decompose intermediate representations that often are not interpretable directly into sparse sums of interpretable features, facilitating better control and subsequent analysis. However, similar analyses and approaches have been lacking for text-to-image models. We investigate the possibility of using SAEs to learn int… ▽ More

    Submitted 30 May, 2025; v1 submitted 28 October, 2024; originally announced October 2024.

  6. arXiv:2407.16807  [pdf, other

    cs.LG cs.AI

    In Search for Architectures and Loss Functions in Multi-Objective Reinforcement Learning

    Authors: Mikhail Terekhov, Caglar Gulcehre

    Abstract: Multi-objective reinforcement learning (MORL) is essential for addressing the intricacies of real-world RL problems, which often require trade-offs between multiple utility functions. However, MORL is challenging due to unstable learning dynamics with deep learning-based function approximators. The research path most taken has been to explore different value-based loss functions for MORL to overco… ▽ More

    Submitted 23 July, 2024; originally announced July 2024.

    Comments: 20 pages, 10 figures, 3 tables

  7. arXiv:2406.18370  [pdf, ps, other

    quant-ph cs.AI cs.LG stat.ML

    Learning pure quantum states (almost) without regret

    Authors: Josep Lumbreras, Mikhail Terekhov, Marco Tomamichel

    Abstract: We initiate the study of sample-optimal quantum state tomography with minimal disturbance to the samples. Can we efficiently learn a precise description of a quantum state through sequential measurements of samples while at the same time making sure that the post-measurement state of the samples is only minimally perturbed? Defining regret as the cumulative disturbance of all samples, the challeng… ▽ More

    Submitted 5 June, 2025; v1 submitted 26 June, 2024; originally announced June 2024.

    Comments: 28 pages, 2 figures

  8. Invariant systems of weighted representatives

    Authors: Anton A. Klyachko, Mikhail S. Terekhov

    Abstract: It is known that, if removing some $n$ edges from a graph $Γ$ destroys all subgraphs isomorphic to a given finite graph $K$, then all subgraphs isomorphic to $K$ can be destroyed by removing at most $|E(K)|\cdot n$ edges, which form a set invariant with respect to all automorphisms of $Γ$. We construct the first examples of (connected) graphs $K$ for which this estimate is not sharp. Our arguments… ▽ More

    Submitted 19 January, 2025; v1 submitted 20 June, 2023; originally announced June 2023.

    Comments: 5 pages. A Russian version of this paper is at http://halgebra.math.msu.su/staff/klyachko/papers.htm . V2: minor corrections

    Journal ref: Journal of Algebraic Combinatorics, 61:3 (2025), 32

  9. arXiv:2202.09590  [pdf, ps, other

    math.CO

    The cost of symmetry in connected graphs

    Authors: M. S. Terekhov

    Abstract: The paper answers the question posed in a joint paper by A. A. Klyachko and N. M. Luneva about the optimality of the estimate for the cost of symmetry in graphs. The original estimate says that if n vertices can be removed from a connected graph so that there is no connected subgraph of isomorphic $Γ$ left in it, then at most $n|V(Γ)|$ vertices that form a set invariant under all automorphisms of… ▽ More

    Submitted 18 June, 2022; v1 submitted 19 February, 2022; originally announced February 2022.

    Comments: in Russian language

  10. arXiv:1609.05423  [pdf, other

    physics.comp-ph math.NA

    An adaptive numerical method for free surface flows passing rigidly mounted obstacles

    Authors: Kirill D. Nikitin, Maxim A. Olshanskii, Kirill M. Terekhov, Yuri V. Vassilevski, Ruslan Yanbarisov

    Abstract: The paper develops a method for the numerical simulation of a free-surface flow of incompressible viscous fluid around a streamlined body. The body is a rigid stationary construction partially submerged in the fluid. The application we are interested in the paper is a flow around a surface mounted offshore oil platform. The numerical method builds on a hybrid finite volume / finite difference disc… ▽ More

    Submitted 9 February, 2017; v1 submitted 18 September, 2016; originally announced September 2016.

    MSC Class: 76D27; 65M08

  11. Ultrasensitive 3He magnetometer for measurements of high magnetic fields

    Authors: A. Nikiel, P. Blümler, W. Heil, M. Hehn, S. Karpuk, A. Maul, E. Otten, L. M. Schreiber, M. Terekhov

    Abstract: We describe a 3He magnetometer capable to measure high magnetic fields (B > 0.1 Tesla) with a relative accuracy of better than 10^-12. Our approach is based on the measurement of the free induction decay of gaseous, nuclear spin polarized 3He following a resonant radio frequency pulse excitation. The measurement sensitivity can be attributed to the long coherent spin precession time T2* being of o… ▽ More

    Submitted 27 May, 2014; originally announced May 2014.

    Comments: 27 pages, 7 figures

  12. 3rd Interplanetary Network Localization, Time History, Fluence, Peak Flux, and Distance Lower Limit of the February 28, 1997 Gamma-Ray Burst

    Authors: K. Hurley, E. Costa, M. Feroci, F. Frontera, T. Cline, D. Dal Fiume, M. Orlandini, M. Boer, E. Mazets, R. Aptekar, S. Golenetskii, M. Terekhov

    Abstract: The gamma-ray burst of 1997 February 28 was localized using the arrival-time analysis method with the Ulysses, BeppoSAX, and WIND spacecraft. The result is a plus-or-minus 31.5 arcsec (3 sigma) wide annulus of possible arrival directions which intersects both the position of the burst determined independently by the SAX Wide Field Camera, and the position of a fading X-ray source detected by the… ▽ More

    Submitted 16 May, 1997; originally announced May 1997.

    Comments: 11 pages, postscript, 2 figures. Accepted for publication in ApJ Letters