Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy Search

Bogoclu, Can; Vosshall, Robert; Cremanns, Kevin; Roos, Dirk

doi:10.1109/ACDSA59508.2024.10467448

Computer Science > Machine Learning

arXiv:2403.15908 (cs)

[Submitted on 23 Mar 2024]

Title:Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy Search

Authors:Can Bogoclu, Robert Vosshall, Kevin Cremanns, Dirk Roos

View PDF HTML (experimental)

Abstract:Probabilistic world models increase data efficiency of model-based reinforcement learning (MBRL) by guiding the policy with their epistemic uncertainty to improve exploration and acquire new samples. Moreover, the uncertainty-aware learning procedures in probabilistic approaches lead to robust policies that are less sensitive to noisy observations compared to uncertainty unaware solutions. We propose to combine trajectory sampling and deep Gaussian covariance network (DGCN) for a data-efficient solution to MBRL problems in an optimal control setting. We compare trajectory sampling with density-based approximation for uncertainty propagation using three different probabilistic world models; Gaussian processes, Bayesian neural networks, and DGCNs. We provide empirical evidence using four different well-known test environments, that our method improves the sample-efficiency over other combinations of uncertainty propagation methods and probabilistic models. During our tests, we place particular emphasis on the robustness of the learned policies with respect to noisy initial states.

Subjects:	Machine Learning (cs.LG); Machine Learning (stat.ML)
Cite as:	arXiv:2403.15908 [cs.LG]
	(or arXiv:2403.15908v1 [cs.LG] for this version)
	https://doi.org/10.48550/arXiv.2403.15908
Related DOI:	https://doi.org/10.1109/ACDSA59508.2024.10467448

Submission history

From: Can Bogoclu [view email]
[v1] Sat, 23 Mar 2024 18:42:22 UTC (465 KB)

Computer Science > Machine Learning

Title:Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy Search

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Machine Learning

Title:Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy Search

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators