Skip to main content

Showing 1–5 of 5 results for author: Korenkevych, D

Searching in archive stat. Search in all archives.
.
  1. arXiv:2310.09426  [pdf, other

    cs.LG stat.ML

    Offline Reinforcement Learning for Optimizing Production Bidding Policies

    Authors: Dmytro Korenkevych, Frank Cheng, Artsiom Balakir, Alex Nikulkov, Lingnan Gao, Zhihao Cen, Zuobing Xu, Zheqing Zhu

    Abstract: The online advertising market, with its thousands of auctions run per second, presents a daunting challenge for advertisers who wish to optimize their spend under a budget constraint. Thus, advertising platforms typically provide automated agents to their customers, which act on their behalf to bid for impression opportunities in real time at scale. Because these proxy agents are owned by the plat… ▽ More

    Submitted 13 October, 2023; originally announced October 2023.

  2. arXiv:1903.11524  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Autoregressive Policies for Continuous Control Deep Reinforcement Learning

    Authors: Dmytro Korenkevych, A. Rupam Mahmood, Gautham Vasan, James Bergstra

    Abstract: Reinforcement learning algorithms rely on exploration to discover new behaviors, which is typically achieved by following a stochastic policy. In continuous control tasks, policies with a Gaussian distribution have been widely adopted. Gaussian exploration however does not result in smooth trajectories that generally correspond to safe and rewarding behaviors in practical tasks. In addition, Gauss… ▽ More

    Submitted 27 March, 2019; originally announced March 2019.

    Comments: Submitted to 28th International Joint Conference on Artificial Intelligence (IJCAI 2019). Video: https://youtu.be/NCpyXBNqNmw Code: https://github.com/dkorenkevych/arp

  3. arXiv:1809.07731  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Benchmarking Reinforcement Learning Algorithms on Real-World Robots

    Authors: A. Rupam Mahmood, Dmytro Korenkevych, Gautham Vasan, William Ma, James Bergstra

    Abstract: Through many recent successes in simulation, model-free reinforcement learning has emerged as a promising approach to solving continuous control robotic tasks. The research community is now able to reproduce, analyze and build quickly on these results due to open source implementations of learning algorithms and simulated benchmark tasks. To carry forward these successes to real-world applications… ▽ More

    Submitted 20 September, 2018; originally announced September 2018.

    Comments: Appears in Proceedings of the Second Conference on Robot Learning (CoRL 2018). Companion video at https://youtu.be/ovDfhvjpQd8 and source code at https://github.com/kindredresearch/SenseAct

  4. arXiv:1803.07067  [pdf, other

    cs.LG cs.AI cs.RO stat.ML

    Setting up a Reinforcement Learning Task with a Real-World Robot

    Authors: A. Rupam Mahmood, Dmytro Korenkevych, Brent J. Komer, James Bergstra

    Abstract: Reinforcement learning is a promising approach to developing hard-to-engineer adaptive solutions for complex and diverse robotic tasks. However, learning with real-world robots is often unreliable and difficult, which resulted in their low adoption in reinforcement learning research. This difficulty is worsened by the lack of guidelines for setting up learning tasks with robots. In this work, we d… ▽ More

    Submitted 19 March, 2018; originally announced March 2018.

    Comments: Submitted to 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS)

  5. arXiv:1611.04528  [pdf, other

    quant-ph cs.LG stat.ML

    Benchmarking Quantum Hardware for Training of Fully Visible Boltzmann Machines

    Authors: Dmytro Korenkevych, Yanbo Xue, Zhengbing Bian, Fabian Chudak, William G. Macready, Jason Rolfe, Evgeny Andriyash

    Abstract: Quantum annealing (QA) is a hardware-based heuristic optimization and sampling method applicable to discrete undirected graphical models. While similar to simulated annealing, QA relies on quantum, rather than thermal, effects to explore complex search spaces. For many classes of problems, QA is known to offer computational advantages over simulated annealing. Here we report on the ability of rece… ▽ More

    Submitted 14 November, 2016; originally announced November 2016.

    Comments: 22 pages, 13 figures, D-Wave quantum system for sampling Boltzmann machines