Skip to main content

Showing 1–1 of 1 results for author: Carvalho, T H

Searching in archive cs. Search in all archives.
.
  1. arXiv:2410.12166  [pdf, other

    cs.LG cs.AI

    Reclaiming the Source of Programmatic Policies: Programmatic versus Latent Spaces

    Authors: Tales H. Carvalho, Kenneth Tjhia, Levi H. S. Lelis

    Abstract: Recent works have introduced LEAPS and HPRL, systems that learn latent spaces of domain-specific languages, which are used to define programmatic policies for partially observable Markov decision processes (POMDPs). These systems induce a latent space while optimizing losses such as the behavior loss, which aim to achieve locality in program behavior, meaning that vectors close in the latent space… ▽ More

    Submitted 15 October, 2024; originally announced October 2024.

    Comments: Published as a conference paper at ICLR 2024