Skip to main content

Showing 1–6 of 6 results for author: Aravindan, S

.
  1. arXiv:2501.09611  [pdf, other

    cs.LG

    EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning

    Authors: Siddharth Aravindan, Dixant Mittal, Wee Sun Lee

    Abstract: Posterior Sampling for Reinforcement Learning (PSRL) is a well-known algorithm that augments model-based reinforcement learning (MBRL) algorithms with Thompson sampling. PSRL maintains posterior distributions of the environment transition dynamics and the reward function, which are intractable for tasks with high-dimensional state and action spaces. Recent works show that dropout, used in conjunct… ▽ More

    Submitted 16 January, 2025; originally announced January 2025.

    Journal ref: Asian Conference on Machine Learning 2024

  2. arXiv:2501.04374  [pdf, other

    cs.CV

    Instructive3D: Editing Large Reconstruction Models with Text Instructions

    Authors: Kunal Kathare, Ankit Dhiman, K Vikas Gowda, Siddharth Aravindan, Shubham Monga, Basavaraja Shanthappa Vandrotti, Lokesh R Boregowda

    Abstract: Transformer based methods have enabled users to create, modify, and comprehend text and image data. Recently proposed Large Reconstruction Models (LRMs) further extend this by providing the ability to generate high-quality 3D models with the help of a single object image. These models, however, lack the ability to manipulate or edit the finer details, such as adding standard design patterns or cha… ▽ More

    Submitted 8 January, 2025; originally announced January 2025.

    Comments: Accepted at WACV 2025. First two authors contributed equally

  3. arXiv:2202.01461  [pdf, ps, other

    cs.AI cs.LG

    ExPoSe: Combining State-Based Exploration with Gradient-Based Online Search

    Authors: Dixant Mittal, Siddharth Aravindan, Wee Sun Lee

    Abstract: Online tree-based search algorithms iteratively simulate trajectories and update action-values for a set of states stored in a tree structure. It works reasonably well in practice but fails to effectively utilise the information gathered from similar states. Depending upon the smoothness of the action-value function, one approach to overcoming this issue is through online learning, where informati… ▽ More

    Submitted 4 March, 2023; v1 submitted 3 February, 2022; originally announced February 2022.

    Journal ref: In Proceedings of the 22nd International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2023)

  4. arXiv:2102.03719  [pdf, other

    cs.LG cs.AI

    State-Aware Variational Thompson Sampling for Deep Q-Networks

    Authors: Siddharth Aravindan, Wee Sun Lee

    Abstract: Thompson sampling is a well-known approach for balancing exploration and exploitation in reinforcement learning. It requires the posterior distribution of value-action functions to be maintained; this is generally intractable for tasks that have a high dimensional state-action space. We derive a variational Thompson sampling approximation for DQNs which uses a deep network whose parameters are per… ▽ More

    Submitted 7 February, 2021; originally announced February 2021.

  5. arXiv:2102.03718  [pdf, other

    cs.LG

    An Analysis of Frame-skipping in Reinforcement Learning

    Authors: Shivaram Kalyanakrishnan, Siddharth Aravindan, Vishwajeet Bagdawat, Varun Bhatt, Harshith Goka, Archit Gupta, Kalpesh Krishna, Vihari Piratla

    Abstract: In the practice of sequential decision making, agents are often designed to sense state at regular intervals of $d$ time steps, $d > 1$, ignoring state information in between sensing steps. While it is clear that this practice can reduce sensing and compute costs, recent results indicate a further benefit. On many Atari console games, reinforcement learning (RL) algorithms deliver substantially be… ▽ More

    Submitted 6 February, 2021; originally announced February 2021.

  6. arXiv:2007.04756  [pdf, other

    cs.AI cs.CV cs.LG cs.NE

    Learning to Prune Deep Neural Networks via Reinforcement Learning

    Authors: Manas Gupta, Siddharth Aravindan, Aleksandra Kalisz, Vijay Chandrasekhar, Lin Jie

    Abstract: This paper proposes PuRL - a deep reinforcement learning (RL) based algorithm for pruning neural networks. Unlike current RL based model compression approaches where feedback is given only at the end of each episode to the agent, PuRL provides rewards at every pruning step. This enables PuRL to achieve sparsity and accuracy comparable to current state-of-the-art methods, while having a much shorte… ▽ More

    Submitted 9 July, 2020; originally announced July 2020.

    Comments: Accepted at the ICML 2020 Workshop on Automated Machine Learning (AutoML 2020)