Skip to main content

Showing 1–6 of 6 results for author: Pankov, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.15080  [pdf, ps, other

    cs.LG cs.AI cs.CL

    SUS backprop: linear backpropagation algorithm for long inputs in transformers

    Authors: Sergey Pankov, Georges Harik

    Abstract: It is straightforward to design an unbiased gradient estimator that stochastically cuts the backpropagation flow through any part of a computational graph. By cutting the parts that have little effect on the computation, one can potentially save a significant amount of backpropagation computation in exchange for a minimal increase in the stochastic gradient variance, in some situations. Such a sit… ▽ More

    Submitted 4 June, 2025; v1 submitted 21 May, 2025; originally announced May 2025.

    Comments: 21 pages, 9 figures; main results unchanged, Fig.5 updated, some text rearranged

  2. arXiv:2402.16010  [pdf, other

    cs.RO physics.bio-ph physics.class-ph

    Energy-conserving intermittent-contact motion in complex models

    Authors: Sergey Pankov

    Abstract: Some mechanical systems, that are modeled to have inelastic collisions, nonetheless possess energy-conserving intermittent-contact solutions, known as collisionless solutions. Such a solution, representing a persistent hopping or walking across a level ground, may be important for understanding animal locomotion or for designing efficient walking machines. So far, collisionless motion has been ana… ▽ More

    Submitted 25 February, 2024; originally announced February 2024.

    Comments: 27 pages, 6 figures

    Journal ref: Communications in Nonlinear Science and Numerical Simulation, 132 (2024), 107895

  3. arXiv:2204.02471  [pdf, other

    cs.RO cs.LG eess.SY

    Configuration Path Control

    Authors: Sergey Pankov

    Abstract: Reinforcement learning methods often produce brittle policies -- policies that perform well during training, but generalize poorly beyond their direct training experience, thus becoming unstable under small disturbances. To address this issue, we propose a method for stabilizing a control policy in the space of configuration paths. It is applied post-training and relies purely on the data produced… ▽ More

    Submitted 5 April, 2022; originally announced April 2022.

    Comments: 12 pages, 3 figures, accepted for publication

    Journal ref: Int. J. Control Autom. Syst. 21, 306-317 (2023)

  4. arXiv:2106.11765  [pdf, other

    physics.bio-ph cs.RO

    Three-dimensional bipedal model with zero-energy-cost walking

    Authors: Sergey Pankov

    Abstract: We study a three-dimensional articulated rigid-body biped model that possesses zero cost of transport walking gaits. Energy losses are avoided due to the complete elimination of the foot-ground collisions by the concerted oscillatory motion of the model's parts. The model consists of two parts connected via a universal joint. It does not rely on any geometry altering mechanisms, massless parts or… ▽ More

    Submitted 19 June, 2021; originally announced June 2021.

    Comments: 20 pages, 7 figures

    Journal ref: Phys. Rev. E 103, 043003 (2021)

  5. arXiv:1811.06225  [pdf, other

    cs.LG stat.ML

    Reward-estimation variance elimination in sequential decision processes

    Authors: Sergey Pankov

    Abstract: Policy gradient methods are very attractive in reinforcement learning due to their model-free nature and convergence guarantees. These methods, however, suffer from high variance in gradient estimation, resulting in poor sample efficiency. To mitigate this issue, a number of variance-reduction approaches have been proposed. Unfortunately, in the challenging problems with delayed rewards, these app… ▽ More

    Submitted 15 November, 2018; originally announced November 2018.

  6. Learning image transformations without training examples

    Authors: Sergey Pankov

    Abstract: The use of image transformations is essential for efficient modeling and learning of visual data. But the class of relevant transformations is large: affine transformations, projective transformations, elastic deformations, ... the list goes on. Therefore, learning these transformations, rather than hand coding them, is of great conceptual interest. To the best of our knowledge, all the related wo… ▽ More

    Submitted 30 September, 2011; originally announced October 2011.

    Comments: 15 pages, 1 figure, ISVC11

    Journal ref: Proc. 7th International Symposium on Visual Computing, part II, pp 168-179, 2011