Skip to main content

Showing 1–2 of 2 results for author: Kalsi, M

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.15661  [pdf, other

    cs.CV cs.AI cs.CL

    UI-Vision: A Desktop-centric GUI Benchmark for Visual Perception and Interaction

    Authors: Shravan Nayak, Xiangru Jian, Kevin Qinghong Lin, Juan A. Rodriguez, Montek Kalsi, Rabiul Awal, Nicolas Chapados, M. Tamer Özsu, Aishwarya Agrawal, David Vazquez, Christopher Pal, Perouz Taslakian, Spandana Gella, Sai Rajeswar

    Abstract: Autonomous agents that navigate Graphical User Interfaces (GUIs) to automate tasks like document editing and file management can greatly enhance computer workflows. While existing research focuses on online settings, desktop environments, critical for many professional and everyday tasks, remain underexplored due to data collection challenges and licensing issues. We introduce UI-Vision, the first… ▽ More

    Submitted 6 May, 2025; v1 submitted 19 March, 2025; originally announced March 2025.

    Comments: This paper has been accepted to the 41st International Conference on Machine Learning (ICML 2025)

  2. arXiv:2403.12309  [pdf, other

    cs.LG cs.AI

    Reinforcement Learning from Delayed Observations via World Models

    Authors: Armin Karamzade, Kyungmin Kim, Montek Kalsi, Roy Fox

    Abstract: In standard reinforcement learning settings, agents typically assume immediate feedback about the effects of their actions after taking them. However, in practice, this assumption may not hold true due to physical constraints and can significantly impact the performance of learning algorithms. In this paper, we address observation delays in partially observable environments. We propose leveraging… ▽ More

    Submitted 25 June, 2024; v1 submitted 18 March, 2024; originally announced March 2024.