Skip to main content

Showing 1–2 of 2 results for author: Pyatko, D

.
  1. arXiv:2405.00662  [pdf, other

    cs.LG

    No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO

    Authors: Skander Moalla, Andrea Miele, Daniil Pyatko, Razvan Pascanu, Caglar Gulcehre

    Abstract: Reinforcement learning (RL) is inherently rife with non-stationarity since the states and rewards the agent observes during training depend on its changing policy. Therefore, networks in deep RL must be capable of adapting to new observations and fitting new targets. However, previous works have observed that networks trained under non-stationarity exhibit an inability to continue learning, termed… ▽ More

    Submitted 20 November, 2024; v1 submitted 1 May, 2024; originally announced May 2024.

    Comments: NeurIPS2024 version. Code and run histories are available at https://github.com/CLAIRE-Labo/no-representation-no-trust

  2. arXiv:2112.13822  [pdf, other

    math-ph math.CO math.DS

    Asymptotics of the number of possible endpoints of a random walk on a directed Hamiltonian metric graph

    Authors: Daniil Pyatko, Vsevolod Chernyshev

    Abstract: In this paper, the leading term of the asymptotics of the number of possible final positions of a random walk on a directed Hamiltonian metric graph is found. Consideration of such dynamical systems could be motivated by problems of propagation of narrow wave packets on metric graphs.

    Submitted 27 December, 2021; originally announced December 2021.

    MSC Class: 11N45; 37A50; 57M15