Skip to main content

Showing 1–16 of 16 results for author: Balloch, J

Searching in archive cs. Search in all archives.
.
  1. arXiv:2505.10330  [pdf, other

    cs.LG cs.AI

    Efficient Adaptation of Reinforcement Learning Agents to Sudden Environmental Change

    Authors: Jonathan Clifford Balloch

    Abstract: Real-world autonomous decision-making systems, from robots to recommendation engines, must operate in environments that change over time. While deep reinforcement learning (RL) has shown an impressive ability to learn optimal policies in stationary environments, most methods are data intensive and assume a world that does not change between training and test time. As a result, conventional RL meth… ▽ More

    Submitted 15 May, 2025; originally announced May 2025.

    Comments: PhD Dissertation, 131 pages

  2. arXiv:2407.19532  [pdf, other

    cs.AI cs.LG

    The Interpretability of Codebooks in Model-Based Reinforcement Learning is Limited

    Authors: Kenneth Eaton, Jonathan Balloch, Julia Kim, Mark Riedl

    Abstract: Interpretability of deep reinforcement learning systems could assist operators with understanding how they interact with their environment. Vector quantization methods -- also called codebook methods -- discretize a neural network's latent space that is often suggested to yield emergent interpretability. We investigate whether vector quantization in fact provides interpretability in model-based re… ▽ More

    Submitted 28 July, 2024; originally announced July 2024.

  3. arXiv:2407.00264  [pdf, other

    cs.AI cs.LG

    External Model Motivated Agents: Reinforcement Learning for Enhanced Environment Sampling

    Authors: Rishav Bhagat, Jonathan Balloch, Zhiyu Lin, Julia Kim, Mark Riedl

    Abstract: Unlike reinforcement learning (RL) agents, humans remain capable multitaskers in changing environments. In spite of only experiencing the world through their own observations and interactions, people know how to balance focusing on tasks with learning about how changes may affect their understanding of the world. This is possible by choosing to solve tasks in ways that are interesting and generall… ▽ More

    Submitted 28 June, 2024; originally announced July 2024.

  4. arXiv:2404.02235  [pdf, other

    cs.LG cs.AI

    Is Exploration All You Need? Effective Exploration Characteristics for Transfer in Reinforcement Learning

    Authors: Jonathan C. Balloch, Rishav Bhagat, Geigh Zollicoffer, Ruoran Jia, Julia Kim, Mark O. Riedl

    Abstract: In deep reinforcement learning (RL) research, there has been a concerted effort to design more efficient and productive exploration methods while solving sparse-reward problems. These exploration methods often share common principles (e.g., improving diversity) and implementation details (e.g., intrinsic reward). Prior work found that non-stationary Markov decision processes (MDPs) require explora… ▽ More

    Submitted 2 April, 2024; originally announced April 2024.

  5. arXiv:2310.08731  [pdf, other

    cs.AI cs.LG eess.SY

    Novelty Detection in Reinforcement Learning with World Models

    Authors: Geigh Zollicoffer, Kenneth Eaton, Jonathan Balloch, Julia Kim, Wei Zhou, Robert Wright, Mark O. Riedl

    Abstract: Reinforcement learning (RL) using world models has found significant recent successes. However, when a sudden change to world mechanics or properties occurs then agent performance and reliability can dramatically decline. We refer to the sudden change in visual properties or state transitions as novelties. Implementing novelty detection within generated world model frameworks is a crucial task for… ▽ More

    Submitted 28 May, 2025; v1 submitted 12 October, 2023; originally announced October 2023.

    Comments: ICML Spotlight 2025

  6. arXiv:2301.06294  [pdf, other

    cs.AI cs.LG cs.SC

    Neuro-Symbolic World Models for Adapting to Open World Novelty

    Authors: Jonathan Balloch, Zhiyu Lin, Robert Wright, Xiangyu Peng, Mustafa Hussain, Aarun Srinivas, Julia Kim, Mark O. Riedl

    Abstract: Open-world novelty--a sudden change in the mechanics or properties of an environment--is a common occurrence in the real world. Novelty adaptation is an agent's ability to improve its policy performance post-novelty. Most reinforcement learning (RL) methods assume that the world is a closed, fixed process. Consequentially, RL policies adapt inefficiently to novelties. To address this, we introduce… ▽ More

    Submitted 16 January, 2023; originally announced January 2023.

    Comments: 9 pages, 8 figures, Extended Abstract accepted for presentation at AAMAS 2023

  7. arXiv:2210.06168  [pdf, ps, other

    cs.LG cs.AI

    The Role of Exploration for Task Transfer in Reinforcement Learning

    Authors: Jonathan C Balloch, Julia Kim, and Jessica L Inman, Mark O Riedl

    Abstract: The exploration--exploitation trade-off in reinforcement learning (RL) is a well-known and much-studied problem that balances greedy action selection with novel experience, and the study of exploration methods is usually only considered in the context of learning the optimal policy for a single learning task. However, in the context of online task transfer, where there is a change to the task duri… ▽ More

    Submitted 10 October, 2022; originally announced October 2022.

    Comments: 4 pages

  8. arXiv:2203.12117  [pdf, other

    cs.AI cs.LG

    NovGrid: A Flexible Grid World for Evaluating Agent Response to Novelty

    Authors: Jonathan Balloch, Zhiyu Lin, Mustafa Hussain, Aarun Srinivas, Robert Wright, Xiangyu Peng, Julia Kim, Mark Riedl

    Abstract: A robust body of reinforcement learning techniques have been developed to solve complex sequential decision making problems. However, these methods assume that train and evaluation tasks come from similarly or identically distributed environments. This assumption does not hold in real life where small novel changes to the environment can make a previously learned policy fail or introduce simpler s… ▽ More

    Submitted 22 March, 2022; originally announced March 2022.

    Comments: 7 pages, 4 figures, AAAI Spring Symposium 2022 on Designing Artificial Intelligence for Open Worlds (Long Oral)

  9. arXiv:2112.03808  [pdf, other

    cs.CL

    Automated Story Generation as Question-Answering

    Authors: Louis Castricato, Spencer Frazier, Jonathan Balloch, Nitya Tarakad, Mark Riedl

    Abstract: Neural language model-based approaches to automated story generation suffer from two important limitations. First, language model-based story generators generally do not work toward a given goal or ending. Second, they often lose coherence as the story gets longer. We propose a novel approach to automated story generation that treats the problem as one of generative question-answering. Our propose… ▽ More

    Submitted 7 December, 2021; originally announced December 2021.

  10. arXiv:2106.09701  [pdf, other

    cs.CV cs.LG

    Always Be Dreaming: A New Approach for Data-Free Class-Incremental Learning

    Authors: James Smith, Yen-Chang Hsu, Jonathan Balloch, Yilin Shen, Hongxia Jin, Zsolt Kira

    Abstract: Modern computer vision applications suffer from catastrophic forgetting when incrementally learning new concepts over time. The most successful approaches to alleviate this forgetting require extensive replay of previously seen data, which is problematic when memory constraints or data legality concerns exist. In this work, we consider the high-impact problem of Data-Free Class-Incremental Learnin… ▽ More

    Submitted 19 August, 2021; v1 submitted 17 June, 2021; originally announced June 2021.

    Comments: Accepted by the 2021 International Conference on Computer Vision (ICCV 2021)

  11. arXiv:2106.02204  [pdf, other

    cs.AI

    Detecting and Adapting to Novelty in Games

    Authors: Xiangyu Peng, Jonathan C. Balloch, Mark O. Riedl

    Abstract: Open-world novelty occurs when the rules of an environment can change abruptly, such as when a game player encounters "house rules". To address open-world novelty, game playing agents must be able to detect when novelty is injected, and to quickly adapt to the new rules. We propose a model-based reinforcement learning approach where game state and rules are represented as knowledge graphs. The kno… ▽ More

    Submitted 3 June, 2021; originally announced June 2021.

    Comments: 10 pages, 5 figures, Accepted to the AAAI21 Workshop on on Reinforcement Learning in Games

  12. arXiv:2104.07472  [pdf, other

    cs.CL

    Fabula Entropy Indexing: Objective Measures of Story Coherence

    Authors: Louis Castricato, Spencer Frazier, Jonathan Balloch, Mark Riedl

    Abstract: Automated story generation remains a difficult area of research because it lacks strong objective measures. Generated stories may be linguistically sound, but in many cases suffer poor narrative coherence required for a compelling, logically-sound story. To address this, we present Fabula Entropy Indexing (FEI), an evaluation method to assess story coherence by measuring the degree to which human… ▽ More

    Submitted 26 October, 2021; v1 submitted 22 March, 2021; originally announced April 2021.

  13. arXiv:2101.09536  [pdf, other

    cs.CV cs.LG

    Memory-Efficient Semi-Supervised Continual Learning: The World is its Own Replay Buffer

    Authors: James Smith, Jonathan Balloch, Yen-Chang Hsu, Zsolt Kira

    Abstract: Rehearsal is a critical component for class-incremental continual learning, yet it requires a substantial memory budget. Our work investigates whether we can significantly reduce this memory budget by leveraging unlabeled data from an agent's environment in a realistic and challenging continual learning paradigm. Specifically, we explore and formalize a novel semi-supervised continual learning (SS… ▽ More

    Submitted 6 May, 2021; v1 submitted 23 January, 2021; originally announced January 2021.

    Comments: Accepted by the 2021 International Joint Conference on Neural Networks (IJCNN 2021)

  14. arXiv:2001.10386  [pdf, other

    cs.RO

    Taking Recoveries to Task: Recovery-Driven Development for Recipe-based Robot Tasks

    Authors: Siddhartha Banerjee, Angel Daruna, David Kent, Weiyu Liu, Jonathan Balloch, Abhinav Jain, Akshay Krishnan, Muhammad Asif Rana, Harish Ravichandar, Binit Shah, Nithin Shrivatsav, Sonia Chernova

    Abstract: Robot task execution when situated in real-world environments is fragile. As such, robot architectures must rely on robust error recovery, adding non-trivial complexity to highly-complex robot systems. To handle this complexity in development, we introduce Recovery-Driven Development (RDD), an iterative task scripting process that facilitates rapid task and recovery development by leveraging hiera… ▽ More

    Submitted 28 January, 2020; originally announced January 2020.

    Comments: Published and presented at International Symposium on Robotics Research (ISRR), 2019 in Hanoi, Vietnam

  15. Tool Macgyvering: Tool Construction Using Geometric Reasoning

    Authors: Lakshmi Nair, Jonathan Balloch, Sonia Chernova

    Abstract: MacGyvering is defined as creating or repairing something in an inventive or improvised way by utilizing objects that are available at hand. In this paper, we explore a subset of Macgyvering problems involving tool construction, i.e., creating tools from parts available in the environment. We formalize the overall problem domain of tool Macgyvering, introducing three levels of complexity for tool… ▽ More

    Submitted 10 February, 2019; originally announced February 2019.

    Comments: Video demonstration available at: https://www.youtube.com/channel/UCxnm8iu1TS75YNXcAiI-nEw Conference: Accepted to International Conference on Robotics and Automation 2019

    Journal ref: International Conference on Robotics and Automation (ICRA) 2019

  16. arXiv:1809.03676  [pdf, other

    cs.CV cs.RO

    Unbiasing Semantic Segmentation For Robot Perception using Synthetic Data Feature Transfer

    Authors: Jonathan C Balloch, Varun Agrawal, Irfan Essa, Sonia Chernova

    Abstract: Robot perception systems need to perform reliable image segmentation in real-time on noisy, raw perception data. State-of-the-art segmentation approaches use large CNN models and carefully constructed datasets; however, these models focus on accuracy at the cost of real-time inference. Furthermore, the standard semantic segmentation datasets are not large enough for training CNNs without augmentat… ▽ More

    Submitted 11 September, 2018; originally announced September 2018.