Adversarially Robust Decision Transformer

Tang, Xiaohang; Marques, Afonso; Kamalaruban, Parameswaran; Bogunovic, Ilija

Computer Science > Machine Learning

arXiv:2407.18414 (cs)

[Submitted on 25 Jul 2024 (v1), last revised 1 Nov 2024 (this version, v2)]

Title:Adversarially Robust Decision Transformer

Authors:Xiaohang Tang, Afonso Marques, Parameswaran Kamalaruban, Ilija Bogunovic

View PDF HTML (experimental)

Abstract:Decision Transformer (DT), as one of the representative Reinforcement Learning via Supervised Learning (RvS) methods, has achieved strong performance in offline learning tasks by leveraging the powerful Transformer architecture for sequential decision-making. However, in adversarial environments, these methods can be non-robust, since the return is dependent on the strategies of both the decision-maker and adversary. Training a probabilistic model conditioned on observed return to predict action can fail to generalize, as the trajectories that achieve a return in the dataset might have done so due to a suboptimal behavior adversary. To address this, we propose a worst-case-aware RvS algorithm, the Adversarially Robust Decision Transformer (ARDT), which learns and conditions the policy on in-sample minimax returns-to-go. ARDT aligns the target return with the worst-case return learned through minimax expectile regression, thereby enhancing robustness against powerful test-time adversaries. In experiments conducted on sequential games with full data coverage, ARDT can generate a maximin (Nash Equilibrium) strategy, the solution with the largest adversarial robustness. In large-scale sequential games and continuous adversarial RL environments with partial data coverage, ARDT demonstrates significantly superior robustness to powerful test-time adversaries and attains higher worst-case returns compared to contemporary DT methods.

Comments:	Accepted to NeurIPS 2024
Subjects:	Machine Learning (cs.LG); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2407.18414 [cs.LG]
	(or arXiv:2407.18414v2 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2407.18414

Submission history

From: Xiaohang Tang [view email]
[v1] Thu, 25 Jul 2024 22:12:47 UTC (1,129 KB)
[v2] Fri, 1 Nov 2024 17:47:03 UTC (1,170 KB)

Computer Science > Machine Learning

Title:Adversarially Robust Decision Transformer

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Adversarially Robust Decision Transformer

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators