Skip to main content

Showing 1–3 of 3 results for author: Badrinaaraayanan, A

Searching in archive cs. Search in all archives.
.
  1. arXiv:2302.02792  [pdf, other

    cs.LG

    Dealing With Non-stationarity in Decentralized Cooperative Multi-Agent Deep Reinforcement Learning via Multi-Timescale Learning

    Authors: Hadi Nekoei, Akilesh Badrinaaraayanan, Amit Sinha, Mohammad Amini, Janarthanan Rajendran, Aditya Mahajan, Sarath Chandar

    Abstract: Decentralized cooperative multi-agent deep reinforcement learning (MARL) can be a versatile learning framework, particularly in scenarios where centralized training is either not possible or not practical. One of the critical challenges in decentralized deep MARL is the non-stationarity of the learning environment when multiple agents are learning concurrently. A commonly used and efficient scheme… ▽ More

    Submitted 17 August, 2023; v1 submitted 6 February, 2023; originally announced February 2023.

  2. arXiv:2103.03216  [pdf, other

    cs.LG cs.AI cs.MA

    Continuous Coordination As a Realistic Scenario for Lifelong Learning

    Authors: Hadi Nekoei, Akilesh Badrinaaraayanan, Aaron Courville, Sarath Chandar

    Abstract: Current deep reinforcement learning (RL) algorithms are still highly task-specific and lack the ability to generalize to new environments. Lifelong learning (LLL), however, aims at solving multiple tasks sequentially by efficiently transferring and using knowledge between tasks. Despite a surge of interest in lifelong RL in recent years, the lack of a realistic testbed makes robust evaluation of L… ▽ More

    Submitted 14 June, 2021; v1 submitted 4 March, 2021; originally announced March 2021.

    Comments: 19 pages with supplementary materials. Added results for Lifelong RL methods and some future work. Accepted to ICML 2021

  3. PatchUp: A Feature-Space Block-Level Regularization Technique for Convolutional Neural Networks

    Authors: Mojtaba Faramarzi, Mohammad Amini, Akilesh Badrinaaraayanan, Vikas Verma, Sarath Chandar

    Abstract: Large capacity deep learning models are often prone to a high generalization gap when trained with a limited amount of labeled training data. A recent class of methods to address this problem uses various ways to construct a new training sample by mixing a pair (or more) of training samples. We propose PatchUp, a hidden state block-level regularization technique for Convolutional Neural Networks (… ▽ More

    Submitted 7 January, 2023; v1 submitted 14 June, 2020; originally announced June 2020.

    Comments: AAAI - 2022

    Journal ref: AAAI, vol. 36, no. 1, pp. 589-597, Jun. 2022