Cross-Domain Perceptual Reward Functions

Edwards, Ashley D.; Sood, Srijan; Isbell Jr, Charles L.

Computer Science > Artificial Intelligence

arXiv:1705.09045 (cs)

[Submitted on 25 May 2017 (v1), last revised 25 Jul 2017 (this version, v3)]

Title:Cross-Domain Perceptual Reward Functions

Authors:Ashley D. Edwards, Srijan Sood, Charles L. Isbell Jr

View PDF

Abstract:In reinforcement learning, we often define goals by specifying rewards within desirable states. One problem with this approach is that we typically need to redefine the rewards each time the goal changes, which often requires some understanding of the solution in the agents environment. When humans are learning to complete tasks, we regularly utilize alternative sources that guide our understanding of the problem. Such task representations allow one to specify goals on their own terms, thus providing specifications that can be appropriately interpreted across various environments. This motivates our own work, in which we represent goals in environments that are different from the agents. We introduce Cross-Domain Perceptual Reward (CDPR) functions, learned rewards that represent the visual similarity between an agents state and a cross-domain goal image. We report results for learning the CDPRs with a deep neural network and using them to solve two tasks with deep reinforcement learning.

Comments:	A shorter version of this paper was accepted to RLDM (this http URL)
Subjects:	Artificial Intelligence (cs.AI)
Cite as:	arXiv:1705.09045 [cs.AI]
	(or arXiv:1705.09045v3 [cs.AI] for this version)
	https://doi.org/10.48550/arXiv.1705.09045

Submission history

From: Ashley Edwards [view email]
[v1] Thu, 25 May 2017 04:54:36 UTC (1,768 KB)
[v2] Wed, 7 Jun 2017 15:44:37 UTC (1,732 KB)
[v3] Tue, 25 Jul 2017 15:40:28 UTC (1,732 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.AI

< prev | next >

new | recent | 2017-05

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Ashley D. Edwards
Charles Lee Isbell Jr.
Charles L. Isbell Jr

export BibTeX citation

Computer Science > Artificial Intelligence

Title:Cross-Domain Perceptual Reward Functions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Artificial Intelligence

Title:Cross-Domain Perceptual Reward Functions

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators