Skip to main content

Showing 1–2 of 2 results for author: Nezami, F N

Searching in archive cs. Search in all archives.
.
  1. arXiv:2502.03214  [pdf, other

    cs.CL cs.AI cs.CV

    iVISPAR -- An Interactive Visual-Spatial Reasoning Benchmark for VLMs

    Authors: Julius Mayer, Mohamad Ballout, Serwan Jassim, Farbod Nosrat Nezami, Elia Bruni

    Abstract: Vision-Language Models (VLMs) are known to struggle with spatial reasoning and visual alignment. To help overcome these limitations, we introduce iVISPAR, an interactive multi-modal benchmark designed to evaluate the spatial reasoning capabilities of VLMs acting as agents. iVISPAR is based on a variant of the sliding tile puzzle-a classic problem that demands logical planning, spatial awareness, a… ▽ More

    Submitted 5 February, 2025; originally announced February 2025.

  2. arXiv:2012.12041  [pdf

    cs.HC

    WestDrive X LoopAR: An open-access virtual reality project in Unity for evaluating user interaction methods during TOR

    Authors: Farbod N. Nezami, Maximilian A. Wächter, Nora Maleki, Philipp Spaniol, Lea M. Kühne, Anke Haas, Johannes M. Pingel, Linus Tiemann, Frederik Nienhaus, Lynn Keller, Sabine König, Peter König, Gordon Pipa

    Abstract: With the further development of highly automated vehicles, drivers will engage in non-related tasks while being driven. Still, drivers have to take over control when requested by the car. Here the question arises, how potentially distracted drivers get back into the control-loop quickly and safely when the car requests a takeover. To investigate effective human-machine interactions in mobile, vers… ▽ More

    Submitted 22 December, 2020; originally announced December 2020.