Skip to main content

Showing 1–2 of 2 results for author: Minkova, L

Searching in archive cs. Search in all archives.
.
  1. arXiv:2412.03446  [pdf, other

    cs.AI

    From Words to Workflows: Automating Business Processes

    Authors: Laura Minkova, Jessica López Espejel, Taki Eddine Toufik Djaidja, Walid Dahhane, El Hassane Ettifouri

    Abstract: As businesses increasingly rely on automation to streamline operations, the limitations of Robotic Process Automation (RPA) have become apparent, particularly its dependence on expert knowledge and inability to handle complex decision-making tasks. Recent advancements in Artificial Intelligence (AI), particularly Generative AI (GenAI) and Large Language Models (LLMs), have paved the way for Intell… ▽ More

    Submitted 4 December, 2024; originally announced December 2024.

    Comments: Under review at Elsevier's Engineering Applications of Artificial Intelligence

  2. arXiv:2407.01558  [pdf, other

    cs.HC cs.AI

    Visual grounding for desktop graphical user interfaces

    Authors: Tassnim Dardouri, Laura Minkova, Jessica López Espejel, Walid Dahhane, El Hassane Ettifouri

    Abstract: Most instance perception and image understanding solutions focus mainly on natural images. However, applications for synthetic images, and more specifically, images of Graphical User Interfaces (GUI) remain limited. This hinders the development of autonomous computer-vision-powered Artificial Intelligence (AI) agents. In this work, we present Instruction Visual Grounding or IVG, a multi-modal solu… ▽ More

    Submitted 17 September, 2024; v1 submitted 5 May, 2024; originally announced July 2024.

    Comments: Preprint submitted to Computer Vision and Image Understanding journal