Skip to main content

Showing 1–2 of 2 results for author: Shabadi, G

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.04632  [pdf, ps, other

    cs.LG

    Composing Agents to Minimize Worst-case Risk

    Authors: Guruprerana Shabadi, Rajeev Alur

    Abstract: From software development to robot control, modern agentic systems decompose complex objectives into a sequence of subtasks and choose a set of specialized AI agents to complete them. We formalize an agentic workflow as a directed acyclic graph, called an agent graph, where edges represent AI agents and paths correspond to feasible compositions of agents. When deploying these systems in the real w… ▽ More

    Submitted 5 June, 2025; originally announced June 2025.

    Comments: 17 pages, 4 figures

  2. arXiv:2402.11650  [pdf, other

    cs.LG cs.LO cs.PL

    Programmatic Reinforcement Learning: Navigating Gridworlds

    Authors: Guruprerana Shabadi, Nathanaël Fijalkow, Théo Matricon

    Abstract: The field of reinforcement learning (RL) is concerned with algorithms for learning optimal policies in unknown stochastic environments. Programmatic RL studies representations of policies as programs, meaning involving higher order constructs such as control loops. Despite attracting a lot of attention at the intersection of the machine learning and formal methods communities, very little is known… ▽ More

    Submitted 10 January, 2025; v1 submitted 18 February, 2024; originally announced February 2024.

    Comments: Published in the proceedings of GenPlan, AAAI 2025 Workshop on Generlization in Planning