Skip to main content

Showing 1–9 of 9 results for author: Pilly, P K

Searching in archive cs. Search in all archives.
.
  1. arXiv:2411.13537  [pdf, other

    cs.LG cs.AI

    Metacognition for Unknown Situations and Environments (MUSE)

    Authors: Rodolfo Valiente, Praveen K. Pilly

    Abstract: Metacognition--the awareness and regulation of one's cognitive processes--is central to human adaptability in unknown situations. In contrast, current autonomous agents often struggle in novel environments due to their limited capacity for adaptation. We hypothesize that metacognition is a critical missing ingredient in adaptive autonomous systems, equipping them with the cognitive flexibility nee… ▽ More

    Submitted 20 November, 2024; originally announced November 2024.

  2. arXiv:2302.10887  [pdf, other

    cs.LG cs.AI

    The configurable tree graph (CT-graph): measurable problems in partially observable and distal reward environments for lifelong reinforcement learning

    Authors: Andrea Soltoggio, Eseoghene Ben-Iwhiwhu, Christos Peridis, Pawel Ladosz, Jeffery Dick, Praveen K. Pilly, Soheil Kolouri

    Abstract: This paper introduces a set of formally defined and transparent problems for reinforcement learning algorithms with the following characteristics: (1) variable degrees of observability (non-Markov observations), (2) distal and sparse rewards, (3) variable and hierarchical reward structure, (4) multiple-task generation, (5) variable problem complexity. The environment provides 1D or 2D categorical… ▽ More

    Submitted 21 January, 2023; originally announced February 2023.

  3. A Domain-Agnostic Approach for Characterization of Lifelong Learning Systems

    Authors: Megan M. Baker, Alexander New, Mario Aguilar-Simon, Ziad Al-Halah, Sébastien M. R. Arnold, Ese Ben-Iwhiwhu, Andrew P. Brna, Ethan Brooks, Ryan C. Brown, Zachary Daniels, Anurag Daram, Fabien Delattre, Ryan Dellana, Eric Eaton, Haotian Fu, Kristen Grauman, Jesse Hostetler, Shariq Iqbal, Cassandra Kent, Nicholas Ketz, Soheil Kolouri, George Konidaris, Dhireesha Kudithipudi, Erik Learned-Miller, Seungwon Lee , et al. (22 additional authors not shown)

    Abstract: Despite the advancement of machine learning techniques in recent years, state-of-the-art systems lack robustness to "real world" events, where the input distributions and tasks encountered by the deployed systems will not be limited to the original training context, and systems will instead need to adapt to novel distributions and tasks while deployed. This critical gap may be addressed through th… ▽ More

    Submitted 18 January, 2023; originally announced January 2023.

    Comments: To appear in Neural Networks

  4. arXiv:2212.11110  [pdf, other

    cs.LG cs.AI stat.ML

    Lifelong Reinforcement Learning with Modulating Masks

    Authors: Eseoghene Ben-Iwhiwhu, Saptarshi Nath, Praveen K. Pilly, Soheil Kolouri, Andrea Soltoggio

    Abstract: Lifelong learning aims to create AI systems that continuously and incrementally learn during a lifetime, similar to biological learning. Attempts so far have met problems, including catastrophic forgetting, interference among tasks, and the inability to exploit previous knowledge. While considerable research has focused on learning multiple supervised classification tasks that involve changes in t… ▽ More

    Submitted 1 August, 2023; v1 submitted 21 December, 2022; originally announced December 2022.

    Comments: Code available at https://github.com/dlpbc/mask-lrl

    Journal ref: Transactions on Machine Learning Research (2023)

  5. arXiv:2209.03207  [pdf, other

    cs.LG

    Concept-modulated model-based offline reinforcement learning for rapid generalization

    Authors: Nicholas A. Ketz, Praveen K. Pilly

    Abstract: The robustness of any machine learning solution is fundamentally bound by the data it was trained on. One way to generalize beyond the original training is through human-informed augmentation of the original dataset; however, it is impossible to specify all possible failure cases that can occur during deployment. To address this limitation we combine model-based reinforcement learning and model-in… ▽ More

    Submitted 7 September, 2022; originally announced September 2022.

  6. Context Meta-Reinforcement Learning via Neuromodulation

    Authors: Eseoghene Ben-Iwhiwhu, Jeffery Dick, Nicholas A. Ketz, Praveen K. Pilly, Andrea Soltoggio

    Abstract: Meta-reinforcement learning (meta-RL) algorithms enable agents to adapt quickly to tasks from few samples in dynamic environments. Such a feat is achieved through dynamic representations in an agent's policy network (obtained via reasoning about task context, model parameter updates, or both). However, obtaining rich dynamic representations for fast adaptation beyond simple benchmark problems is c… ▽ More

    Submitted 26 April, 2022; v1 submitted 29 October, 2021; originally announced November 2021.

    Journal ref: Neural Networks 2022

  7. arXiv:2104.08604  [pdf, other

    cs.LG

    Lifelong Learning with Sketched Structural Regularization

    Authors: Haoran Li, Aditya Krishnan, Jingfeng Wu, Soheil Kolouri, Praveen K. Pilly, Vladimir Braverman

    Abstract: Preventing catastrophic forgetting while continually learning new tasks is an essential problem in lifelong learning. Structural regularization (SR) refers to a family of algorithms that mitigate catastrophic forgetting by penalizing the network for changing its "critical parameters" from previous tasks while learning a new one. The penalty is often induced via a quadratic regularizer defined by a… ▽ More

    Submitted 17 April, 2021; originally announced April 2021.

  8. arXiv:1903.04566  [pdf, other

    cs.LG stat.ML

    Complementary Learning for Overcoming Catastrophic Forgetting Using Experience Replay

    Authors: Mohammad Rostami, Soheil Kolouri, Praveen K. Pilly

    Abstract: Despite huge success, deep networks are unable to learn effectively in sequential multitask learning settings as they forget the past learned tasks after learning new tasks. Inspired from complementary learning systems theory, we address this challenge by learning a generative model that couples the current task to the past learned tasks through a discriminative embedding space. We learn an abstra… ▽ More

    Submitted 31 May, 2019; v1 submitted 11 March, 2019; originally announced March 2019.

  9. arXiv:1903.00068  [pdf, other

    cs.NE cs.CV cs.LG

    Neuromodulated Goal-Driven Perception in Uncertain Domains

    Authors: Xinyun Zou, Soheil Kolouri, Praveen K. Pilly, Jeffrey L. Krichmar

    Abstract: In uncertain domains, the goals are often unknown and need to be predicted by the organism or system. In this paper, contrastive excitation backprop (c-EB) was used in a goal-driven perception task with pairs of noisy MNIST digits, where the system had to increase attention to one of the two digits corresponding to a goal (i.e., even, odd, low value, or high value) and decrease attention to the di… ▽ More

    Submitted 16 February, 2019; originally announced March 2019.