Skip to main content

Showing 1–7 of 7 results for author: Andrist, S

Searching in archive cs. Search in all archives.
.
  1. arXiv:2409.10525  [pdf, other

    cs.MM cs.AI cs.CL

    "Is This It?": Towards Ecologically Valid Benchmarks for Situated Collaboration

    Authors: Dan Bohus, Sean Andrist, Yuwei Bao, Eric Horvitz, Ann Paradiso

    Abstract: We report initial work towards constructing ecologically valid benchmarks to assess the capabilities of large multimodal models for engaging in situated collaboration. In contrast to existing benchmarks, in which question-answer pairs are generated post hoc over preexisting or synthetic datasets via templates, human annotators, or large language models (LLMs), we propose and investigate an interac… ▽ More

    Submitted 30 August, 2024; originally announced September 2024.

  2. arXiv:2405.13035  [pdf, other

    cs.HC cs.AI

    SIGMA: An Open-Source Interactive System for Mixed-Reality Task Assistance Research

    Authors: Dan Bohus, Sean Andrist, Nick Saw, Ann Paradiso, Ishani Chakraborty, Mahdi Rad

    Abstract: We introduce an open-source system called SIGMA (short for "Situated Interactive Guidance, Monitoring, and Assistance") as a platform for conducting research on task-assistive agents in mixed-reality scenarios. The system leverages the sensing and rendering affordances of a head-mounted mixed-reality device in conjunction with large language and vision models to guide users step by step through pr… ▽ More

    Submitted 16 May, 2024; originally announced May 2024.

    Comments: 10 pages, 5 figures

  3. arXiv:2309.17024  [pdf, other

    cs.CV

    HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World

    Authors: Xin Wang, Taein Kwon, Mahdi Rad, Bowen Pan, Ishani Chakraborty, Sean Andrist, Dan Bohus, Ashley Feniello, Bugra Tekin, Felipe Vieira Frujeri, Neel Joshi, Marc Pollefeys

    Abstract: Building an interactive AI assistant that can perceive, reason, and collaborate with humans in the real world has been a long-standing pursuit in the AI community. This work is part of a broader research effort to develop intelligent agents that can interactively guide humans through performing tasks in the physical world. As a first step in this direction, we introduce HoloAssist, a large-scale e… ▽ More

    Submitted 29 September, 2023; originally announced September 2023.

    Comments: ICCV 2023

  4. arXiv:2103.15975  [pdf, other

    cs.AI

    Platform for Situated Intelligence

    Authors: Dan Bohus, Sean Andrist, Ashley Feniello, Nick Saw, Mihai Jalobeanu, Patrick Sweeney, Anne Loomis Thompson, Eric Horvitz

    Abstract: We introduce Platform for Situated Intelligence, an open-source framework created to support the rapid development and study of multimodal, integrative-AI systems. The framework provides infrastructure for sensing, fusing, and making inferences from temporal streams of data across different modalities, a set of tools that enable visualization and debugging, and an ecosystem of components that enca… ▽ More

    Submitted 29 March, 2021; originally announced March 2021.

    Comments: 29 pages, 14 figures, Microsoft Research Technical Report

    Report number: MSR-TR-2021-02

  5. arXiv:2010.06084  [pdf, other

    cs.AI cs.RO

    Accelerating the Development of Multimodal, Integrative-AI Systems with Platform for Situated Intelligence

    Authors: Sean Andrist, Dan Bohus

    Abstract: We describe Platform for Situated Intelligence, an open-source framework for multimodal, integrative-AI systems. The framework provides infrastructure, tools, and components that enable and accelerate the development of applications that process multimodal streams of data and in which timing is critical. The framework is particularly well-suited for developing physically situated interactive syste… ▽ More

    Submitted 12 October, 2020; originally announced October 2020.

    Comments: 5 pages, 1 figure. Submitted to the 2020 AAAI Fall Symposium: Trust and Explainability in Artificial Intelligence for Human-Robot Interaction

  6. arXiv:2008.07668  [pdf, other

    cs.RO cs.CV cs.HC

    REFORM: Recognizing F-formations for Social Robots

    Authors: Hooman Hedayati, Annika Muehlbradt, Daniel J. Szafir, Sean Andrist

    Abstract: Recognizing and understanding conversational groups, or F-formations, is a critical task for situated agents designed to interact with humans. F-formations contain complex structures and dynamics, yet are used intuitively by people in everyday face-to-face conversations. Prior research exploring ways of identifying F-formations has largely relied on heuristic algorithms that may not capture the ri… ▽ More

    Submitted 17 August, 2020; originally announced August 2020.

    Comments: IROS 2020

  7. arXiv:1905.05179  [pdf, other

    cs.LG cs.AI cs.SE stat.ML

    Metareasoning in Modular Software Systems: On-the-Fly Configuration using Reinforcement Learning with Rich Contextual Representations

    Authors: Aditya Modi, Debadeepta Dey, Alekh Agarwal, Adith Swaminathan, Besmira Nushi, Sean Andrist, Eric Horvitz

    Abstract: Assemblies of modular subsystems are being pressed into service to perform sensing, reasoning, and decision making in high-stakes, time-critical tasks in such areas as transportation, healthcare, and industrial automation. We address the opportunity to maximize the utility of an overall computing system by employing reinforcement learning to guide the configuration of the set of interacting module… ▽ More

    Submitted 12 May, 2019; originally announced May 2019.

    Comments: 12 pages, 7 figures, 2 tables