Skip to main content

Showing 1–4 of 4 results for author: Loula, J

.
  1. arXiv:2504.13139  [pdf, other

    cs.CL cs.AI cs.LG

    Syntactic and Semantic Control of Large Language Models via Sequential Monte Carlo

    Authors: João Loula, Benjamin LeBrun, Li Du, Ben Lipkin, Clemente Pasti, Gabriel Grand, Tianyu Liu, Yahya Emara, Marjorie Freedman, Jason Eisner, Ryan Cotterell, Vikash Mansinghka, Alexander K. Lew, Tim Vieira, Timothy J. O'Donnell

    Abstract: A wide range of LM applications require generating text that conforms to syntactic or semantic constraints. Imposing such constraints can be naturally framed as probabilistic conditioning, but exact generation from the resulting distribution -- which can differ substantially from the LM's base distribution -- is generally intractable. In this work, we develop an architecture for controlled LM gene… ▽ More

    Submitted 18 April, 2025; v1 submitted 17 April, 2025; originally announced April 2025.

    Comments: 34 pages, 4 figures

  2. arXiv:2504.05410  [pdf, other

    cs.CL cs.AI cs.LG

    Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling

    Authors: Benjamin Lipkin, Benjamin LeBrun, Jacob Hoover Vigly, João Loula, David R. MacIver, Li Du, Jason Eisner, Ryan Cotterell, Vikash Mansinghka, Timothy J. O'Donnell, Alexander K. Lew, Tim Vieira

    Abstract: The dominant approach to generating from language models subject to some constraint is locally constrained decoding (LCD), incrementally sampling tokens at each time step such that the constraint is never violated. Typically, this is achieved through token masking: looping over the vocabulary and excluding non-conforming tokens. There are two important problems with this approach. (i) Evaluating t… ▽ More

    Submitted 7 April, 2025; originally announced April 2025.

  3. arXiv:2107.12544  [pdf, other

    cs.AI

    Human-Level Reinforcement Learning through Theory-Based Modeling, Exploration, and Planning

    Authors: Pedro A. Tsividis, Joao Loula, Jake Burga, Nathan Foss, Andres Campero, Thomas Pouncy, Samuel J. Gershman, Joshua B. Tenenbaum

    Abstract: Reinforcement learning (RL) studies how an agent comes to achieve reward in an environment through interactions over time. Recent advances in machine RL have surpassed human expertise at the world's oldest board games and many classic video games, but they require vast quantities of experience to learn successfully -- none of today's algorithms account for the human ability to learn so many differ… ▽ More

    Submitted 26 July, 2021; originally announced July 2021.

  4. arXiv:1807.07545  [pdf, other

    cs.CL cs.AI cs.LG

    Rearranging the Familiar: Testing Compositional Generalization in Recurrent Networks

    Authors: João Loula, Marco Baroni, Brenden M. Lake

    Abstract: Systematic compositionality is the ability to recombine meaningful units with regular and predictable outcomes, and it's seen as key to humans' capacity for generalization in language. Recent work has studied systematic compositionality in modern seq2seq models using generalization to novel navigation instructions in a grounded environment as a probing tool, requiring models to quickly bootstrap t… ▽ More

    Submitted 19 July, 2018; originally announced July 2018.