Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery

Nikonova, Ekaterina; Xue, Cheng; Renz, Jochen

Abstract:Deep reinforcement learning suffers from catastrophic forgetting and sample inefficiency making it less applicable to the ever-changing real world. However, the ability to use previously learned knowledge is essential for AI agents to quickly adapt to novelties. Often, certain spatial information observed by the agent in the previous interactions can be leveraged to infer task-specific rules. Inferred rules can then help the agent to avoid potentially dangerous situations in the previously unseen states and guide the learning process increasing agent's novelty adaptation speed. In this work, we propose a general framework that is applicable to deep reinforcement learning agents. Our framework provides the agent with an autonomous way to discover the task-specific rules in the novel environments and self-supervise it's learning. We provide a rule-driven deep Q-learning agent (RDQ) as one possible implementation of that framework. We show that RDQ successfully extracts task-specific rules as it interacts with the world and uses them to drastically increase its learning efficiency. In our experiments, we show that the RDQ agent is significantly more resilient to the novelties than the baseline agents, and is able to detect and adapt to novel situations faster.

Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:2311.14270 [cs.AI]
	(or arXiv:2311.14270v1 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.2311.14270

Computer Science > Artificial Intelligence

Title:Efficient Open-world Reinforcement Learning via Knowledge Distillation and Autonomous Rule Discovery

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators