Skip to main content

Showing 1–4 of 4 results for author: Dirac, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2506.14821  [pdf, ps, other

    cs.LG cs.AI cs.CV

    Reinforcing VLMs to Use Tools for Detailed Visual Reasoning Under Resource Constraints

    Authors: Sunil Kumar, Bowen Zhao, Leo Dirac, Paulina Varshavskaya

    Abstract: Despite tremendous recent advances in large model reasoning ability, vision-language models (VLMs) still struggle with detailed visual reasoning, especially when compute resources are limited. To address this challenge, we draw inspiration from methods like Deepseek-r1 for VLMs and train smaller-scale models with Group Relative Policy Optimization (GRPO) to use external tools such as zoom. The gre… ▽ More

    Submitted 10 June, 2025; originally announced June 2025.

  2. arXiv:2409.17080  [pdf, other

    cs.CV cs.CL

    Can Vision Language Models Learn from Visual Demonstrations of Ambiguous Spatial Reasoning?

    Authors: Bowen Zhao, Leo Parker Dirac, Paulina Varshavskaya

    Abstract: Large vision-language models (VLMs) have become state-of-the-art for many computer vision tasks, with in-context learning (ICL) as a popular adaptation strategy for new ones. But can VLMs learn novel concepts purely from visual demonstrations, or are they limited to adapting to the output format of ICL examples? We propose a new benchmark we call Spatial Visual Ambiguity Tasks (SVAT) that challeng… ▽ More

    Submitted 25 September, 2024; originally announced September 2024.

    Comments: 13 pages, 4 figures. Code released at https://github.com/groundlight/vlm-visual-demonstrations

  3. arXiv:2012.08483  [pdf, other

    cs.LG

    Amazon SageMaker Autopilot: a white box AutoML solution at scale

    Authors: Piali Das, Valerio Perrone, Nikita Ivkin, Tanya Bansal, Zohar Karnin, Huibin Shen, Iaroslav Shcherbatyi, Yotam Elor, Wilton Wu, Aida Zolic, Thibaut Lienart, Alex Tang, Amr Ahmed, Jean Baptiste Faddoul, Rodolphe Jenatton, Fela Winkelmolen, Philip Gautier, Leo Dirac, Andre Perunicic, Miroslav Miladinovic, Giovanni Zappella, Cédric Archambeau, Matthias Seeger, Bhaskar Dutt, Laurence Rouesnel

    Abstract: AutoML systems provide a black-box solution to machine learning problems by selecting the right way of processing features, choosing an algorithm and tuning the hyperparameters of the entire pipeline. Although these systems perform well on many datasets, there is still a non-negligible number of datasets for which the one-shot solution produced by each particular system would provide sub-par perfo… ▽ More

    Submitted 16 December, 2020; v1 submitted 15 December, 2020; originally announced December 2020.

  4. arXiv:1911.01562  [pdf, other

    cs.LG cs.AI cs.RO

    DeepRacer: Educational Autonomous Racing Platform for Experimentation with Sim2Real Reinforcement Learning

    Authors: Bharathan Balaji, Sunil Mallya, Sahika Genc, Saurabh Gupta, Leo Dirac, Vineet Khare, Gourav Roy, Tao Sun, Yunzhe Tao, Brian Townsend, Eddie Calleja, Sunil Muralidhara, Dhanasekar Karuppasamy

    Abstract: DeepRacer is a platform for end-to-end experimentation with RL and can be used to systematically investigate the key challenges in developing intelligent control systems. Using the platform, we demonstrate how a 1/18th scale car can learn to drive autonomously using RL with a monocular camera. It is trained in simulation with no additional tuning in physical world and demonstrates: 1) formulation… ▽ More

    Submitted 4 November, 2019; originally announced November 2019.