Skip to main content

Showing 1–15 of 15 results for author: Dibia, V

Searching in archive cs. Search in all archives.
.
  1. arXiv:2503.02068  [pdf, other

    cs.MA cs.AI cs.HC

    Interactive Debugging and Steering of Multi-Agent AI Systems

    Authors: Will Epperson, Gagan Bansal, Victor Dibia, Adam Fourney, Jack Gerrits, Erkang Zhu, Saleema Amershi

    Abstract: Fully autonomous teams of LLM-powered AI agents are emerging that collaborate to perform complex tasks for users. What challenges do developers face when trying to build and debug these AI agent teams? In formative interviews with five AI agent developers, we identify core challenges: difficulty reviewing long agent conversations to localize errors, lack of support in current tools for interactive… ▽ More

    Submitted 3 March, 2025; originally announced March 2025.

    Comments: Published at CHI 25

  2. arXiv:2412.10380  [pdf, other

    cs.HC cs.AI

    Challenges in Human-Agent Communication

    Authors: Gagan Bansal, Jennifer Wortman Vaughan, Saleema Amershi, Eric Horvitz, Adam Fourney, Hussein Mozannar, Victor Dibia, Daniel S. Weld

    Abstract: Remarkable advancements in modern generative foundation models have enabled the development of sophisticated and highly capable autonomous agents that can observe their environment, invoke tools, and communicate with other agents to solve problems. Although such agents can communicate with users through natural language, their complexity and wide-ranging failure modes present novel challenges for… ▽ More

    Submitted 27 November, 2024; originally announced December 2024.

  3. arXiv:2411.04468  [pdf, other

    cs.AI cs.MA

    Magentic-One: A Generalist Multi-Agent System for Solving Complex Tasks

    Authors: Adam Fourney, Gagan Bansal, Hussein Mozannar, Cheng Tan, Eduardo Salinas, Erkang, Zhu, Friederike Niedtner, Grace Proebsting, Griffin Bassman, Jack Gerrits, Jacob Alber, Peter Chang, Ricky Loynd, Robert West, Victor Dibia, Ahmed Awadallah, Ece Kamar, Rafah Hosn, Saleema Amershi

    Abstract: Modern AI agents, driven by advances in large foundation models, promise to enhance our productivity and transform our lives by augmenting our knowledge and capabilities. To achieve this vision, AI agents must effectively plan, perform multi-step reasoning and actions, respond to novel observations, and recover from errors, to successfully complete complex tasks across a wide range of scenarios. I… ▽ More

    Submitted 7 November, 2024; originally announced November 2024.

  4. arXiv:2409.18475  [pdf, other

    cs.AI cs.HC

    Data Analysis in the Era of Generative AI

    Authors: Jeevana Priya Inala, Chenglong Wang, Steven Drucker, Gonzalo Ramos, Victor Dibia, Nathalie Riche, Dave Brown, Dan Marshall, Jianfeng Gao

    Abstract: This paper explores the potential of AI-powered tools to reshape data analysis, focusing on design considerations and challenges. We explore how the emergence of large language and multimodal models offers new opportunities to enhance various stages of data analysis workflow by translating high-level user intentions into executable code, charts, and insights. We then examine human-centered design… ▽ More

    Submitted 27 September, 2024; originally announced September 2024.

  5. arXiv:2408.15247  [pdf, other

    cs.SE cs.AI cs.CL cs.HC cs.LG

    AutoGen Studio: A No-Code Developer Tool for Building and Debugging Multi-Agent Systems

    Authors: Victor Dibia, Jingya Chen, Gagan Bansal, Suff Syed, Adam Fourney, Erkang Zhu, Chi Wang, Saleema Amershi

    Abstract: Multi-agent systems, where multiple agents (generative AI models + tools) collaborate, are emerging as an effective pattern for solving long-running, complex tasks in numerous domains. However, specifying their parameters (such as models, tools, and orchestration mechanisms etc,.) and debugging them remains challenging for most developers. To address this challenge, we present AUTOGEN STUDIO, a no… ▽ More

    Submitted 8 August, 2024; originally announced August 2024.

    Comments: 8 pages

  6. arXiv:2408.09365  [pdf, other

    cs.AI cs.CL

    Concept Distillation from Strong to Weak Models via Hypotheses-to-Theories Prompting

    Authors: Emmanuel Aboah Boateng, Cassiano O. Becker, Nabiha Asghar, Kabir Walia, Ashwin Srinivasan, Ehi Nosakhare, Soundar Srinivasan, Victor Dibia

    Abstract: Hand-crafting high quality prompts to optimize the performance of language models is a complicated and labor-intensive process. Furthermore, when migrating to newer, smaller, or weaker models (possibly due to latency or cost gains), prompts need to be updated to re-optimize the task performance. We propose Concept Distillation (CD), an automatic prompt optimization technique for enhancing weaker m… ▽ More

    Submitted 22 February, 2025; v1 submitted 18 August, 2024; originally announced August 2024.

    Comments: Accepted to NAACL 2025; 17 pages, 8 figures

  7. arXiv:2402.09015  [pdf, other

    cs.CL cs.AI

    Towards better Human-Agent Alignment: Assessing Task Utility in LLM-Powered Applications

    Authors: Negar Arabzadeh, Julia Kiseleva, Qingyun Wu, Chi Wang, Ahmed Awadallah, Victor Dibia, Adam Fourney, Charles Clarke

    Abstract: The rapid development in the field of Large Language Models (LLMs) has led to a surge in applications that facilitate collaboration among multiple agents to assist humans in their daily tasks. However, a significant gap remains in assessing whether LLM-powered applications genuinely enhance user experience and task execution efficiency. This highlights the pressing need for methods to verify utili… ▽ More

    Submitted 22 February, 2024; v1 submitted 14 February, 2024; originally announced February 2024.

  8. arXiv:2402.07104  [pdf

    cs.HC

    The Aleph & Other Metaphors for Image Generation

    Authors: Gonzalo Ramos, Rick Barraza, Victor Dibia, Sharon Lo

    Abstract: In this position paper, we reflect on fictional stories dealing with the infinite and how they connect with the current, fast-evolving field of image generation models. We draw attention to how some of these literary constructs can serve as powerful metaphors for guiding human-centered design and technical thinking in the space of these emerging technologies and the experiences we build around the… ▽ More

    Submitted 11 February, 2024; originally announced February 2024.

  9. arXiv:2312.02206  [pdf, other

    cs.AI cs.CL

    Axiomatic Preference Modeling for Longform Question Answering

    Authors: Corby Rosset, Guoqing Zheng, Victor Dibia, Ahmed Awadallah, Paul Bennett

    Abstract: The remarkable abilities of large language models (LLMs) like GPT-4 partially stem from post-training processes like Reinforcement Learning from Human Feedback (RLHF) involving human preferences encoded in a reward model. However, these reward models (RMs) often lack direct knowledge of why, or under what principles, the preferences annotations were made. In this study, we identify principles that… ▽ More

    Submitted 2 December, 2023; originally announced December 2023.

    Comments: Accepted to EMNLP 2023

  10. arXiv:2303.02927  [pdf, other

    cs.AI cs.HC cs.PL

    LIDA: A Tool for Automatic Generation of Grammar-Agnostic Visualizations and Infographics using Large Language Models

    Authors: Victor Dibia

    Abstract: Systems that support users in the automatic creation of visualizations must address several subtasks - understand the semantics of data, enumerate relevant visualization goals and generate visualization specifications. In this work, we pose visualization generation as a multi-stage generation problem and argue that well-orchestrated pipelines based on large language models (LLMs) such as ChatGPT/G… ▽ More

    Submitted 5 June, 2023; v1 submitted 6 March, 2023; originally announced March 2023.

    Comments: Accepted at ACL 2023 (Demonstration track). Fix formatting issues, update information on evaluation metrics, prompts and project website (https://microsoft.github.io/lida/)

  11. arXiv:2210.16494  [pdf, other

    cs.SE cs.AI cs.HC cs.PL

    Aligning Offline Metrics and Human Judgments of Value for Code Generation Models

    Authors: Victor Dibia, Adam Fourney, Gagan Bansal, Forough Poursabzi-Sangdeh, Han Liu, Saleema Amershi

    Abstract: Large language models have demonstrated great potential to assist programmers in generating code. For such human-AI pair programming scenarios, we empirically demonstrate that while generated code is most often evaluated in terms of their functional correctness (i.e., whether generations pass available unit tests), correctness does not fully capture (e.g., may underestimate) the productivity gains… ▽ More

    Submitted 13 June, 2023; v1 submitted 29 October, 2022; originally announced October 2022.

    Comments: Accepted at ACL 2023 (Findings)

  12. arXiv:2007.15211  [pdf, other

    cs.CL cs.AI cs.HC cs.IR

    NeuralQA: A Usable Library for Question Answering (Contextual Query Expansion + BERT) on Large Datasets

    Authors: Victor Dibia

    Abstract: Existing tools for Question Answering (QA) have challenges that limit their use in practice. They can be complex to set up or integrate with existing infrastructure, do not offer configurable interactive interfaces, and do not cover the full set of subtasks that frequently comprise the QA pipeline (query expansion, retrieval, reading, and explanation/sensemaking). To help address these issues, we… ▽ More

    Submitted 27 November, 2020; v1 submitted 29 July, 2020; originally announced July 2020.

    Comments: Published at Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations. (EMNLP 2020), Demo track. 8 pages, 4 figures

  13. arXiv:1807.06641  [pdf, other

    cs.HC

    Beyond Heuristics: Learning Visualization Design

    Authors: Bahador Saket, Dominik Moritz, Halden Lin, Victor Dibia, Cagatay Demiralp, Jeffrey Heer

    Abstract: In this paper, we describe a research agenda for deriving design principles directly from data. We argue that it is time to go beyond manually curated and applied visualization design guidelines. We propose learning models of visualization design from data collected using graphical perception studies and build tools powered by the learned models. To achieve this vision, we need to 1) develop scala… ▽ More

    Submitted 15 August, 2018; v1 submitted 17 July, 2018; originally announced July 2018.

  14. arXiv:1805.10723  [pdf, other

    cs.HC cs.AI

    Designing for Democratization: Introducing Novices to Artificial Intelligence Via Maker Kits

    Authors: Victor Dibia, Aaron Cox, Justin Weisz

    Abstract: Existing research highlight the myriad of benefits realized when technology is sufficiently democratized and made accessible to non-technical or novice users. However, democratizing complex technologies such as artificial intelligence (AI) remains hard. In this work, we draw on theoretical underpinnings from the democratization of innovation, in exploring the design of maker kits that help introdu… ▽ More

    Submitted 5 January, 2019; v1 submitted 27 May, 2018; originally announced May 2018.

    Comments: Early paper draft - Updated references, author list, figure captions, acknowledgements

  15. arXiv:1804.03126  [pdf, other

    cs.HC cs.AI cs.LG

    Data2Vis: Automatic Generation of Data Visualizations Using Sequence to Sequence Recurrent Neural Networks

    Authors: Victor Dibia, Çağatay Demiralp

    Abstract: Rapidly creating effective visualizations using expressive grammars is challenging for users who have limited time and limited skills in statistics and data visualization. Even high-level, dedicated visualization tools often require users to manually select among data attributes, decide which transformations to apply, and specify mappings between visual encoding variables and raw or transformed at… ▽ More

    Submitted 2 November, 2018; v1 submitted 9 April, 2018; originally announced April 2018.

    Comments: IEEE VDS'18