Measuring Interventional Robustness in Reinforcement Learning

Avery, Katherine; Kenney, Jack; Amaranath, Pracheta; Cai, Erica; Jensen, David

Computer Science > Machine Learning

arXiv:2209.09058 (cs)

[Submitted on 19 Sep 2022]

Title:Measuring Interventional Robustness in Reinforcement Learning

Authors:Katherine Avery, Jack Kenney, Pracheta Amaranath, Erica Cai, David Jensen

View PDF

Abstract:Recent work in reinforcement learning has focused on several characteristics of learned policies that go beyond maximizing reward. These properties include fairness, explainability, generalization, and robustness. In this paper, we define interventional robustness (IR), a measure of how much variability is introduced into learned policies by incidental aspects of the training procedure, such as the order of training data or the particular exploratory actions taken by agents. A training procedure has high IR when the agents it produces take very similar actions under intervention, despite variation in these incidental aspects of the training procedure. We develop an intuitive, quantitative measure of IR and calculate it for eight algorithms in three Atari environments across dozens of interventions and states. From these experiments, we find that IR varies with the amount of training and type of algorithm and that high performance does not imply high IR, as one might expect.

Comments:	17 pages, 13 figures
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2209.09058 [cs.LG]
	(or arXiv:2209.09058v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2209.09058

Submission history

From: Katherine Avery [view email]
[v1] Mon, 19 Sep 2022 14:50:05 UTC (5,620 KB)

Computer Science > Machine Learning

Title:Measuring Interventional Robustness in Reinforcement Learning

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Measuring Interventional Robustness in Reinforcement Learning

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators