Skip to main content

Showing 1–4 of 4 results for author: Kemaev, I

Searching in archive stat. Search in all archives.
.
  1. arXiv:2505.17895  [pdf, ps, other

    stat.ML cs.AI cs.LG

    DataRater: Meta-Learned Dataset Curation

    Authors: Dan A. Calian, Gregory Farquhar, Iurii Kemaev, Luisa M. Zintgraf, Matteo Hessel, Jeremy Shar, Junhyuk Oh, András György, Tom Schaul, Jeffrey Dean, Hado van Hasselt, David Silver

    Abstract: The quality of foundation models depends heavily on their training data. Consequently, great efforts have been put into dataset curation. Yet most approaches rely on manual tuning of coarse-grained mixtures of large buckets of data, or filtering by hand-crafted heuristics. An approach that is ultimately more scalable (let alone more satisfying) is to \emph{learn} which data is actually valuable fo… ▽ More

    Submitted 23 May, 2025; originally announced May 2025.

    ACM Class: I.2.6

  2. arXiv:2505.00793  [pdf, ps, other

    cs.LG cs.AI stat.ML

    Scalable Meta-Learning via Mixed-Mode Differentiation

    Authors: Iurii Kemaev, Dan A Calian, Luisa M Zintgraf, Gregory Farquhar, Hado van Hasselt

    Abstract: Gradient-based bilevel optimisation is a powerful technique with applications in hyperparameter optimisation, task adaptation, algorithm discovery, meta-learning more broadly, and beyond. It often requires differentiating through the gradient-based optimisation itself, leading to "gradient-of-a-gradient" calculations with computationally expensive second-order and mixed derivatives. While modern a… ▽ More

    Submitted 9 June, 2025; v1 submitted 1 May, 2025; originally announced May 2025.

  3. arXiv:2105.05347  [pdf, other

    cs.LG cs.AI stat.ML

    Return-based Scaling: Yet Another Normalisation Trick for Deep RL

    Authors: Tom Schaul, Georg Ostrovski, Iurii Kemaev, Diana Borsa

    Abstract: Scaling issues are mundane yet irritating for practitioners of reinforcement learning. Error scales vary across domains, tasks, and stages of learning; sometimes by many orders of magnitude. This can be detrimental to learning speed and stability, create interference between learning tasks, and necessitate substantial tuning. We revisit this topic for agents based on temporal-difference learning,… ▽ More

    Submitted 11 May, 2021; originally announced May 2021.

  4. arXiv:1811.04380  [pdf, other

    stat.ML cs.LG

    ReSet: Learning Recurrent Dynamic Routing in ResNet-like Neural Networks

    Authors: Iurii Kemaev, Daniil Polykovskiy, Dmitry Vetrov

    Abstract: Neural Network is a powerful Machine Learning tool that shows outstanding performance in Computer Vision, Natural Language Processing, and Artificial Intelligence. In particular, recently proposed ResNet architecture and its modifications produce state-of-the-art results in image classification problems. ResNet and most of the previously proposed architectures have a fixed structure and apply the… ▽ More

    Submitted 11 November, 2018; originally announced November 2018.

    Comments: Published in Proceedings of The 10th Asian Conference on Machine Learning, http://proceedings.mlr.press/v95/kemaev18a.html

    Journal ref: Proceedings of The 10th Asian Conference on Machine Learning, PMLR 95:422-437, 2018